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ABSTRACT 



The Stylist is a Waterloo Pascal program that analyzes the style of English prose. 
.A. "style checker". The Stylist pertains to Computational Stylistics and Computer 
Assisted Composition Instruction (CACI). The Stylist creates an affective model of 
the te.xt based upon the following characteristics of its component words : etymology, 
tangibility, difficulty, emotional connotation and vigor. The Stylist then compares this 
model to the standards of fiction or nonfiction texts and reports results and 
recommendations to the user. 

The Stylist also creates a concordance of the user's input text using a new data 
structure called a Concordance Search Tree (CST). A CST is a binary search tree with 
a linked list threaded through it recording the order of the use of each word. An 
inorder traversal of the tree, with a traversal of the linked list during each visit, creates 
a concordance. 

This thesis also review's related literature and programs. 
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I. INTRODUCTION 



A. BACKGROUND 

1 . Computational Stylistics 

Computational Stylistics is the computer-assisted study of literary style. 
Computational Linguistics is the computer-assisted study of language itself These 
disciplines are the automated subsets of Statistical (or Quantitative) Stylistics and 
Linguistics, which are the mathematical studies of style and language. All of these 
disciplines involve quantifying aspects of language and then manipulating these 
quantities in an attempt to gain insight into how and why the language works. 

The history of stylistics can be traced back to 1851, when Augustus de 
Morgan suggested that word-length could prove to be a distinctive trait of a writer's 
style. [Ref 1] This suggestion prompted T.C. Mendenhall, an American geophysicist, 
to investigate whether mean word-length could resolve authorship problems such as 
those posed by some of the disputed Shakespeare plays or the letters written under the 
pen name of Junius. Working in the late 19th century', Mendenhall analyzed the word- 
lengths of some two million words from various periods of English literature, using a 
primitive tabulating dev’ice that spit out reels of paper. His results, however, proved 
little. 

In the early 20th century', another possible characteristic of style, the 
frequency distribution of words, came under wide-spread investigation. G.K. Zipf 
postulated a "Rank-Frequency Law", by which a ranking of the use of words in a text 
would show a constant decrease from the most-used word down to the least-used word. 
[Ref 2] Other scholars, such as G. U. Yule, investigated such aspects as the richness of 
a writer's vocabulary and the length of sentences. [Ref 3: pp. 363 - 390] By the 1950s 
and 1960s, the application of statistical methods to the study of literature had reached 
new heights of sophistication and complexity. Some scholars were investigating the 
significance of the ratio of verbs to adjectives and others were applying rigorous 
statistical techniques. Despite the many fascinating insights offered by these lines of 
inquiry', their fmal significance and scientific credibility remained a question of much 
dispute. Enemies of the new disciplines included Norm Chomsky, the pioneer of 
formal languages, who argued that writing is a very human activity which involves a 
great deal of chance, and as such is not easily quantified. [Ref 4] 
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Computational stylistics began as a natural outgrowth of statistical stylistics. 
With the spread of computer access some thirty years ago, some scholars quickly 
realized the potential for their use in literary studies: 

"I first heard of computers in 1955 when my wife (a mathematician) told me that 
she was going to work for an oil company as a program analyst ... It took me a 
while to understand her flowcharts and computer programs, but w'hen I did. 1 
realized that a computer could be used to solve other problems as well. Since 
then, 1 have used a computer for numerous applications relating to my w'ork in 
the English Department of Cleveland State University, and have taught many 
others ... the power of electronic data processing in the study of language and 
literature." [Ref. 5] 

The first challenge to the field of Computational Stylistics was to translate 
literary texts to electronic data. Much of the early literature of the field is devoted to 
this basic problem. .A,fter the first two decades, corpora, or bodies of literary’ texts in 
machine readable form, had been developed. The Lancaster-Oslo/ Bergen (LOB) 
Corpus is a structured collection of 500 two-thousand-word texts of written British 
English, .-kn .A.merican version of this effort is the Brown Corpus, constructed by 
Brown University.^ 

In addition, many literary' classics have been painstakingly entered by 
keyboard. They are available from the Oxford Computing Laboratory at Oxford 
University and the American Philological Association. [Ref 5] With the increasing 
capability and decreasing cost of optical readers, the problem of translating literary’ text 
to electronic data should become trivial. 

The usefulness of computers to statistical stylistics quickly became 
indisputable. Scholars used to labor to copy down on 3" X 5" cards each occurrence of 
every noun used in a Shakespeare play and then sort these cards by hand. [Ref 6: pp. 
33 - 50] Now a computer program could manipulate the text in any number of ways 
w’ithin minutes. One of the other prominent successes of the use of computers in 
literary studies w’as the automation of the process of forming concordances. .A 
concordance is an alphabetical listing of all the significant words used in a text, 
together with the enclosing phrase. Before the advent of computers, scholars 
laboriously built concordances of the Bible and the plays of Shakespeare. The power 
of the computer made possible giant concordance-building projects such as one for the 

^Electronic forms of both of these corpora are available from: The Norwegian 
Computational Center for Humanistic Research, P.O. Box 53, University of Bergen 
N-5014, Bergen, Norway 
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Dead Sea Scrolls. The Centro Atomazione Analisi Linguista in Gallarate, Italy, used 
computers to build this concordance, which helped to resolve some of the missing or 
obliterated words. [Ref. 7] 

One of the next important challenges to computational linguistics was the 
issue of disambiguation. Many words have more than one meaning and can belong to 
more than one parts of speech. An e.xample in English is the word "flies" in the 
following two sentences: 

Time flies like an arrow. 

Fruit flies like an apple. 



A human reader can easily decide that "flies" in the first sentence is a verb and 
that "flies" in the second sentence is a noun. This decision is disambiguation. In order 
to compute such characteristics as verb-noun ratio, it is first necessary to disambiguate 
the parts of speech. Recent advances in computational linguistics have led to 
programs that can do this with more than 90% success. [Ref 8] [Ref 9: pp. 139 - 150] 
[Ref 10] 

Scholars continue to argue about the usefulness of statistical stylistics. Effons 
to discover and prove laws of distribution have not met with unchallenged success. 
Nevertheless, the work done in the past 130 years has laid some groundwork for the 
use of computers to study and analyze written prose. All of the professorial attempts 
thus have led to success at a humbler level : the tutorial. The history of stylistics is 
now culminating at this level, in the form of Computer Assisted Composition 
Instruction. 

2. Computer Assisted Composition Instruction (CACl) 

CACI is a new discipline which has begun to emerge in the past decade. The 
success of word processing programs created a growing population of people who 
expected computers to help them to write. Enterprises such Bell Laboratories and 
centers of learning such as The Pennsylvania State University began to borrow from 
the field of stylistics as they developed computer programs that would help students 
and workers plan, write and edit prose. Several universities, such as Colorado State 
University, successfully instituted CACI as part of their composition curriculum. 
[Ref 11] The earliest CACI programs required a mainframe or at least a minicomputer. 
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With the expansion of the power of personal computers, however, software houses 
such as DecisionWare Inc. began to v,Tite CACI programs for that environment. Over 
10,000 copies of DecisionWare's RightWriter are now in use at corporations, agencies 
and universities around the world, according to their advertisements. Given that word 
processing remains the single most common use of personal computers, as one 
megabyte of main storage becomes commonplace on new personal computers, style 
checkers such as RightWriter may soon become as popular as spell checkers have 
become in the past five years. 

B. SCOPE OF THE STYLIST 

The Stylist is a style checker akin to Bell Laborator>''s Writers Workbench and 
DecisionWare Inc.'s RightWriter. It does not help the user to plan or to compose his 
product. Instead, it analyzes the finished product and provides that analysis to the 
user for his consideration toward revision. The Stylist does not determine parts of 
speech, as does Writer's Workbench; nor does it suggest alternatives to hackneyed 
phrases, as does RightWriter. What the Stylist does instead is to analyze some of the 
characteristics of the words used in the text. By doing so. The Stylist can distinguish 
between short, vigorous, germanic, emotional texts and long, lazy latin texts. The 
success of this effort could and should be incorporated into more extensive programs 
such as Writer's Workbench. 

In addition. The Stylist solves the traditional problem of creating a concordance 
in a novel and elegant way. This solution is optimal for the personal computer 
environment of the coming few years. 
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II. RESEARCH FOR THE STYLIST 



A. LITERATURE REVIEW 

When I first conceived of The Stylist. I believed that a "style checker" was a 
completely original idea. Little did I know that major universities and great 
coportations had been working on the problem for decades. My research began with 
the Encyclopedia Brittanica, where I discovered the e.xistance of the fields of stylistics 
and statistical linguistics. Using these as subjects, I searched the Dewey decimal 
system and several automated data bases for titles. The books I discovered referenced 
the two journals which publish many of the pertinent articles : Computational 
Linguistics, and Computers and the Humanities. These, in turn, carried advertisements 
for some of the currently available software. They also identified the current centers of 
learning, some of which I contacted for guidance and information. The experience of 
researching the field demoted me from its inventor to its lowhest tyro. It also subjected 
me to many ideas, some of which I incorporated into The Stylist. The bibliography 
lists the sources which I unearthed which may benefit other students. In this 
discussion, 1 will cover only those articles which directly Influenced the development of 
The Stylist. 

1. Affective Tone 

C.W. Anderson and G.E. .McMaster reported on a program called PSA 
(Psychological Semantic Analysis) which analyzes the emotional tone of a text. 
(Ref 12] They built upon the work of D.R. Heise, who ascribed values of "Evaluation, 
Activity and Potency" to the thousand most commonly-used words in the English 
language. [Ref 13] PSA allows the user to enter his text one line at a time; if any word 
is ambiguous, PSA asks the user to disambiguate. PSA then matches these words to 
the 1000 Heise-word dictionarv’, adding values up to create a profile of the user's text. 
Table 1 helps clarify the meanings of the three Heise-word categories. 

After building and testing PSA, Anderson and McMaster concluded that: 
"The affective tone of whole passages can be measured by computer-assisted collection 
of . . . scores of those words ... for which Heise (1965) has provided semantic 
meanings." (Ref 12] 
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TABLE 1 

HEISE WORDS VALUES 



EVALUATION 



High 


Neutral 


Low 


church 

God 

beauty 

pleasant 


experiment 

prove 

mountain 

suggest 


war 

fire 

disease 

bad 




ACTIVITY 




High 


Neutral 


Low 


fire 

great 

attack 

fight 


well 

know 

last 

presence 


dead 
si lent 
sleep 
rock 




POTENCY 




High 


Neutral 


Low 


steel 

iron 

rock 

hard 


hear 

rich 

all 

indicate 


love 

kiss 

baby 

wife 



The original idea of The Stylist envisioned doing just such a count of words 
chosen for their frequency of use and their characteristics such as etymology, emotional 
connotation and vigor. Anderson and McMaster convinced me that this method had a 
sound psychological base. Moreoever, Heise's categories of Evaluation and Activity 
seemed to correspond directly to my envisioned categories of emotional connotation 
and vigor. His category of Potency, however, struck me and continues to strike me as 
distressingly Freudian. Because The Stylist would not use that category and because it 
would use others, I decided not to borrow Heise's words and values, but rather to build 
my own dictionaiy and ascribe my own values. This seemed a reasonable approach, 
given the statement by Anderson and McMasters that "there is much commonality in 
the emotional response of different persons to words and objects." 

2. Readability and Sentence and Word Length 

Two of the most common stylistic measures are sentence length and word 
length. The most common use of these measures is to determine readability. The basic 
idea is that short sentences are easy to read and long sentences are hard to read. 
Various formulae, such as the Kincaid, the Automated Readability Index (ARI), the 
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Colcman-l.iau, and the Flcsch Reading Ease Score, (Ref. 10] attempt to determine 
readability as a straight-forward function of sentence and word length. These formulae 
have had considerable impact on the teaching of English. Most style checkers use 
them. Some, like RightWritcr, admonishes the writer any time a sentence grows 
beyond 22 words or so. Having read a great deal of Victorian novels and Madison 
Avenue copy. I'm well aware that the ideal 20th century sentence is short. It's 
vigorous. Easy to understand. Maybe even dispenses with its verb as it rushes tow'ard 
its punctuation! The mechani/tation of this modern tendency into simplistic formulae, 
however, seems to me pernicious. .A sentence should be the unit of a complete idea. If 
we limit ourselves to short sentences, we may be limiting ourselves to small ideas. The 
average sentence of the prose of the Age of Reason was do words. If Samuel Johnson 
were to live today, no one would let him finish a sentence. My distrust of these 
formulae redoubled when I read "Readability is a Four-l.etter Word," by Jack Selzer. 
(Ref 14] In this article, Selzer forcefully makes the points that readability is a 
subjective quality which is greatly influenced by factors such as arrangement of ideas, 
reader background and interest, and dilTiculty of vocabulary’. The Stylist does not use 
readability formulae. It does Hag sentences which seem to be run-on. More 
importantly, it gauges the dilliculty of the vocabulary. If the writer uses hard words, 
long words, latinate words and long sentences. The Stylist warns him, particularly if the 
work IS fiction. 

B. PRODUCT TESTING 
I. Right Writer 

As part of my research, I procured a copy of Right Writer* and tested it with 
some of my own writing. Overall, the program impressed me with its capabilities and 
its engineering. RightWritcr reproduces the users text with inserted comments. Its 
constant challenge of long or complex sentences forced me to rec.xamine each case in 
particular. Several lengthy sentences became two short ones. It never let me begin a 
sentence with "But," a foible of mine. It applauded my writing when it was strong and 
it derided it when it was pompous. One attractive feature was the production of a 
alphabetical word list with frequency of occurrence. This feature demonstrated my 
overfondness for the words, "ancient" and "thousand". RightWritcr also suggested 

-RightWritcr (tin) is a product of Decisionwarc Inc., 2033 Wood St., Suite 218, 
Sarasota, I E 33577, (813) 952-921 1. 
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substitutes for hackneyed or useless plirases such as "the fact that". Overall, 
RightWriter proved itself extensively useful, 

2. PC-Style 

PC-St\le^ is a much less ambitious program than RightWriter. PC-Style 
required only 40K of RAM, compared to 192K for RightWriter. PC-Style is also much 
cheaper, costing only S29.95 as compared to RighiWriter's S95. I tested it with the 
same test data set I had used on RightWriter. PC-Stylc has a nice human-factor 
feature, in that it constantly displays to the user how much it has done and how far it 
is from finishing. RightWriter is more a colPce-break program : you execute it and 
then you go make some colfee. PC-Style, however, had little to recommend itself 
besides this feature. It relies upon a readability formula. It also attempts some 
affective modeling, based upon its dictionary of 50 action verbs. This miniscule 
dictionary is inadequate lor the task. A match of less than two percent of my input 
text with these 50 words was typical; such a small sample is inadequate to qualify the 
vigor of a passage. Another simplistic but more valuable tactic of PC-Style was to 
count the frequency of "personal" words, such as "1", "you" and "we". Although the 
conventions of the more stufl'y forms of writing forbid them, it is generally accepted 
that most technical writing benefits from direct, personal pronouns. PC-Style 
reinforces the clarity and forccfulness of direct rhetoric. Despite these few nice 
features, PC-Style is a too simplistic to be of lasting utility. 

3. Writer's Workbench 

.Although I was unable to experience Writer's Workbench, I did obtain 
enough research materials to form an impression of its utility. (Ref 10) Writer's 
Workbench'* is actually a complex of 32 programs. Together, these programs provide 
more than all the Icatures of RightWriter. Sl YLE calculates readability, using the 
preciously-discussed Ibrmulae. It also analyzes sentence type (simple, complex or 
compound). S'lYI.O is able to disambiguate the words of the input text with 95°/o 
accuracy. It then analyzes the use of verbs and modifiers. If a passage relies too 
heavily on the passive voice or it is fat with modifiers, STYLE warns the user. The 
follow-on program. DlCflON, detects hackneyed phrases. SUGGEST suggests 
replacements. Overall, Writer's Workbench appears to be the industry standard. 



^PC-Style (tin) is a product of ButtonWare Inc,, P.O. Box 5786, Bellevue, WA 
98006, (206) 454-0479. 

■* Writer's Workbench (tm) is a product of Bell Laboratories, Murray Hill, New 
Jersy, 07974. 
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C CONCLUSIONS BASED UPON RESEARCH 

Now I knew that I would not be writing the first style checker. Nor would I be 
wTiting the second, third, fourth or fifth. Some solace could be found in the fact that 
the previous programs relieved me of the need to incorporate all stylistic features into 
The Stylist. I could extend the affective modeling of PS.A by including other values of 
words. Three of these new values could be more objective-etymology, difficulty and 
tangibility. Together with vigor and emotional connotation, these values should be 
able to describe a profile of the user's input text. 

Additionally, I could increase the accuracy of such style checkers by pointing out 
the need to make allowances for the genre of the user's input text. Any reader of 
technical as well as fictional writing knows that the characteristics of these two types 
often vary more widely than the style of writers within the type. Comparative style 
analysis, therefore, would clearly seem to need to take the genre into account. 
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III. DESIGN OF THE STYLIST 



A. CHARACTERISTICS OF WORDS 

The fundamental idea of The Stylist is that individual words have power. The 
denotation of a word is its meaning. The connotation of a word is its emotional 
impact. For e.xample, "pupil", "student", and "scholar" all denote a person who studies 
and learns. The emotional connotations, however, range from the humble "pupil" 
through the familiar "student" to the lofty "scholar". Besides these emotional 
connotations, words have other intrinsic values which can be quantified. The Stylist 
would concentrate on the following values: Etymology, Tangibility, DifEculty, 
Emotional Connotation and Vigor. 

1. Etymology 

One of the beauties of the English language is that its vocabulary embraces 
two main sources: native and borrowed. Our native words mainly come to us from the 
Anglo-Saxon tongues. Borrowed words come from Latin, mostly by way of the French 
of the Norman conquest. The following table illustrates the differences : 



TABLE 2 

EX.AMPLES OF NATIVE AND BORROWED WORDS 



Native Words 



Borrowed Words 



Man 

Finger 

Thinker 

Fire 

Book 

Dirtbag 



Person 

Digit 

Philosopher 

Conflagration 

Volume 

Miscreant 



Native words are short, strong and rough. Borrowed words tend toward 
length, gentility and elegance. Good English prose (particularly good fiction) favors 
native words. Poor English prose (particularly bad technical writing) exhibits a 
tendency to overutilize latinate etymology. By counting up the number of native and 
borrowed words in a user's input text. The Stylist could see how it compares to good 
writing of the appropriate genre. 
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2. Tangibility 

A word can either evoke an image of a thing or it can refer to an idea. I call 
the former, "tangible" and the latter, "intangible". The following table illustrates the 
difference: 



TABLE 3 

EXAMPLES OF TANGIBLE AND INTANGIBLE WORDS 


TANGIBLE 


INTANGIBLE 


Rock 


Ethereal 


Lips 


Automatically 


Beehive 


Then 


Corvette 


Rely 


Hammer 


Preliminary 



Tangible words are concrete, exact and evocative. Intangible words are ideal, 
general and cognitive. All writing uses both. Good writing usually takes advantage of 
tangible words. Even the most philosophical writing benefits from the use of tangible 
words. (See Appendix E for an analysis of a Platonic dialogue). Good fiction writing 
rarely strays too far toward the intangible. The Stylist could count the tangible and 
intangible words of the user's input text and compare this count to good writing of the 
appropriate genre. 

3. Difficulty 

This is one of the most indisputable characteristics of words. For the 
purposes of The Stylist, I defined four levels of difficulty : Elementar\’, High School, 
Graduate and Postgraduate. The words of each level are those most likely to be used 
in speech with ease by an average person of that educational level. Although I'm 
aware that reading, writing and speaking vocabularies are different, I contend that a 
person reading a word within his reading vocabulary but outside of his speaking 
vocabulary often must pause for a mental translation into more simple terms. The act 
of this pausing raises the difficulty level of the text. Table 4 illustrates the categories. 
These examples are based upon my judgements. 

By counting the occurrence of difficult words. The Stylist could determine the 
overall difficulty of the vocabulary^ and thus the readibility of the text. 
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TABLE 4 




EXAMPLES OF WORDS OF VARIOUS DIFFICULTY 


ELEMENTARY 


HIGH SCHOOL 


GRADUATE 


POSTGRADUATE 


Big 

Sister 

Spoon 

Flying 

Wish 

Done 

Handsome 


Tardy 

Rendezvous 

Transmission 

Foreman 

Process 

Joyous 

Unaergo 


Matrix 

Immaterial 

Tonality 

Universal 

Induction 

Processor 

Linear 


Execrable 

Parsimony 

Recursive 

Homomorph 

Dilatory 

Zygote 

Synergy 



4. Emotional Connotations 

The Stylist would have five categories of emotional connotation : Sublime, 
Pleasant, Neutral, Unpleasant and Horrid. The following table illustrates the 
categories; 



1 




TABLE 5 






1 EXAMPLES OF WORDS OF VARIOUS EMOTIONAL CONNOTATIONS 

1 


1 SUBLIME 


PLEASANT 


NEUTRAL 


UNPLEASANT 


HORRID 


; Beauty 
Sunrise 
Victory 
Love 
God 

Paradise 


Hapoy 

Food 

Friendly 

Warm 

Helpful 

Sex 


The 

which 

Brick 

Is 

Name 

When 


Damage 

Loss 

Insulting 

Loser 

Cost 

Wound 


Cancer 

Murder 

Whore 

Fuck 

Death 

Traitor 



By counting the use of these types of words, The Stylist could determine the overall 
emotional tone of the passage. It could also detect flat or emotional writing. 

5. Vigor 

A related but distinct characteristic is vigor. SubUme and horrid words tend to 
be highly vigorous, but not all highly vigorous words are emotional. Examples of 
vigorous but unemotional words are "sprint", "rush" and "cross". This category also 
tends to be more objective. For example, the word, "soldier" can have widely different 
connotations for different people. The word "soldier" would please a career Army 
officer but it would displease a survivor of the Japanese occupation of Canton. Both 
could agree that "sohder" is a vigorous word. 
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TABLE 6 

EXAMPLES OF WORDS OF VARIOUS VIGOR 



VIOLENT 



ENERGETIC 



CALM 



INERT 



Destroy 



Creation 

Fire 



Sprint 

Dive 



Read 

Write 



From 

Into 



Atomic 



Discipline 



Wedding 

Steam 

Flying 



Manager 



Ocean 

Blue 

Sought 



Comma 

Paper 

Format 



Something 



Holocaust 

Conqueror 



By calculating the vigor of the words of the user's input text, The Stylist 
would be able to estimate its overall strength. 

B. THE NEED FOR A DICTIONARY 

To quantify the above five aspects of the words of the user's input text, it would 
be necessary to maintain a dictionaiy of the most commonly used words and their 
values. The original idea of The Stylist forsaw just such a dictionary. After reading 
.Anderson [Ref 12] and testing PC-Style, the idea seemed less original but still valid. 
The heart of The Stylist, therefore, would be its dictionary'. Before 1 began any top- 
down designing, I first wanted to explore the technical challenges and possible pitfalls 
of building such a dictionarv’. A review of data structure literature convinced me that 
two approaches were the most feasible; hashing and Binaiy Search Tree. Since 1 had 
already decided that The Stylist would also produce a concordance, I had to take into 
account the need for efficient alphabetical traversal of the words of the user's input 
text. My intuition that hashing would be preferable for dictionary look-up but would 
not lend itself easily to concordance-building was confirmed by a passage in an 
excellent text by Donald E. Knuth. [Ref 15: p. 540] Having decided to implement the 
dictionaiy with a Binary Search Tree (BST), I began to build a series of prototypes. 

C. THE COPYDIX PROTOTYPES 

CopyDixl through CopyDix6 were early experiments in using a Binary Search 
Trees for dictionary lookup. Naturally, the word was the key value. The experience of 
the CopyDix series taught me the followng lessons; 
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1. The Benefit of Preorder Storage 

The structure of a Binar>‘ Search Tree is maintained during execution as a 
system of pointers. These pointers refer to locations in memory which pertain only to 
that execution. If a dictionary' is modified during e.xecution (for e.xample, if new words 
are added), then the BST should be stored to its file in preorder. An inorder storage 
seems the most logical way but it is actually the worst. If a BST is stored using an 
inorder traversal, the next time it is loaded it will be that least bushy of all BSTs; a 
linked list. Storage using preorder will cause the next loading of the tree to duplicate 
the last. 

2. The Need for AVL 

Even with preorder storage, the dictionary needed the capability to change; to 
grow or shrink as the user desired. Any particular office or curriculum has its own 
special vocabulary. I wanted to give The Stylist the ability to adapt its dictionary to 
the vocabulary of its user's environment. Without some mechanism for rebalancing 
the BST after the insertion of new words or the deletion of unwanted words, the BST 
could become increasingly lopsided. Searching for words in such a lopsided tree would 
become inefficient. Clearly, an AVL scheme was required. I adapted an AVL insertion 
routine from an excellent te.xt by Niklaus Wirth. [Ref 16; pps. 220 - 2211 

3. Storage requirements 

The CopyDix series allowed me to see whether a BST of 5000 words and 
associated values would fit into the main storage available to a healthy personal 
computer. I determined that 1.5 megabytes would be sufficient for The Stylist and its 
data structures. The Copydix series allowed me to create such a structure and to prove 
that it would not require more than one megabyte. 

4. Text Processing 

The CopyDix series also identified some unforseen comple.xities in processing 
text. For example, in order to include contractions, it was necessary' to include the 
single quotation mark (') in the dictionary. Waterloo Pascal uses this character to 
define the beginning and end of user-defmed constants and strings. To include 
contractions, it was necessary' to declare "Succ('@')" instead of (')! There were a few 
other similar problems requiring equally unhappy solutions. 

5. Building a Dictionary 

Finally, the CopyDix series allowed me to begin to build a dictionary of most 
frequently used words. I ran the te.xt of two novels and several technical articles 
through the Copydix series, accumulating a dictionary' of over 2000 words. 



D. CONCORDANCE SEARCH TREE (CST) 

1. Original Concept 

Based upon the experience of the CopyDix series, I began to play around with 
the various ways of building a concordance. The literature provided two examples of 
how the problem has already been solved. One solution was to create a search tree. 
The key value of the search tree was the word; two associated tables, TOKEN and 
TYPE, recorded the information about the sequence of occurrence of the words. 
[Ref 7: pp. 186 - 191] Another solution familiar to most computer scientists was the 
Key Words in Context (KWIC) program designed by Parnas. [Ref 17] This solution 
creates a KWIC listing by circular shifting each line and then sorting each line; such a 
KWIC listing is similar to a concordance. 

I wanted to create a concordance without relying upon any external storage. 
My experience with the CopyDix series had shown that text processing is both 1, 0 and 
computation bound: 1,0 bound because large text files must be loaded; computation 
bound because each word must be processed character by character. I wanted the 
solution to eliminate the need for further input or output (such as using secondary 
storage to build files containing KWIC lines). I also wanted to minimize computation 
by allowing The Stylist to remember the order of occurrence of each word of the input 
text, without having to recompute it. I wanted my solution to flow as naturally as 
possible from the process of looking each word up in the dictionary. I didn't want to 
have to process each word twice, or to search for the position of any word twice. 
Ideally, the Stylist would have enough main storage available that it could retain all 
important information, without resorting to recomputation or to secondaiy storage. 
These ideas led me play to around with various ways of using a BST to record the 
order of occurrence of the words of the user's input text. One obvious solution would 
be to have each node of the BST contain pointers to associated data structures such as 
an array of linked lists containing the sentences of the input text. Such a solution, 
how'ever, would require storing the words twice. CopyDix had shown that words take 
up a great deal of storage space. Storing them twice would be wasteful. With these 
ideas in mind, I lit upon the idea of the Concordance Search Tree (CST). 

A CST is a Binary Search Tree with a linked list threaded through it. This 
linked list is two-way, connecting each word of the BST to the word used before it and 
to the word used after it. An inorder traversal of the BST visits each word in 
alphabetical order. During each visit, a traversal of the linked list in the "backwards" 
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direction would encounter all the words used before that word. Printing these words, 
then printing the key word being visited, and finally printing the words in the 
"forwards" 'direction creates a concordance. 

This solution offers some exciting facilities. It allows the user's input text to 
be accessed in any number of ways. If a concordance of three, five, seven or thirteen 
words per line is desired, changing a few global constants immediately fulfills that 
desire. If only the sentences containing a certain word are desired, a insert search of 
the CST keying on that word, and then a concordance traversal on that node, provides 
all those sentences. In short, a CST is a complex structure with a challenging but 
strong intuitive appeal. It provides a great deal of flexibility to the processing of text. 

One added wTinkle of complexity lies in the fact that many words of the input 
text occur more than once. The word, "the", for example, occurs many times. To 
guarantee a correct concordance traversal of the CST, each occurrence of the word 
"the" must be associated with a unique pointer. This requirement is satisfied by 
creating a unique node of pointers associated with each occurrence of the word. The 
main node of the CST contains the word, its values, and left and right pointers. It also 
contains a pointer called "down", which points to the linked list of unique nodes of 
pointers. The first such node pertains to the first occurrence of the word, "the", in the 
user's input text. It points to the word used before and the word used after that 
occurrence of the word "the". (These pointers are called "last" and "next".) It also 
points "down" to the second occurrence of the word "the". So the Stylist creates a 
concordance through the following traversals: an inorder traversal of the CST visits 
each main node in alphabetical order. During that visit. The Stylist goes down the 
linked list of the occurrences of that word. During each step down the linked list. The 
Stylist traverses "lastward" and then "nextward" to print out the line of that 
occurrence. When The Stylist reaches the bottom of the downward list, it continues 
the inorder traversal. 

2. The Tril - Tri9 Prototypes 

To examine the feasibility of the CST concept. I wrote a series of progressively 
more capable prototypes called Tril through Tri9. (The names refer to a "Trinaiy 
Search Tree".) 1 found that I could implement all the procedures necessary to build 
and traverse a CST in only 137 lines of code. Moreover, the performance of the 
concept, both in terms of main storage and processor requirements, seemed extremely 
satisfactory. 
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E. DESIGN OF THE STYLIST 

Having used a series of prototypes to identify and solve the critical issues, I set 
them aside and began to design The Stylist. After jotting down some logic flowcharts 
and data flow diagrams, I decomposed the problem into three main modules which I 
called "Reader", "Researcher" and "Reporter". 

1. The Reader Module 

Only Reader has access to the user's input text. It reads that text and passes 
along a data structure called "readnode", which is merely the word and its length. The 
Reader also signals the ends of sentences and the end of the file. 

2. The Researcher Module 

Researcher receives the "readnodes", researches the qualities of that word, 
builds a profile of the user's input text based upon the tally of the qualities of those 
words, and then passes that profile on to the Reporter Module. If the user wants one. 
Researcher also creates the concordance. Only Researcher has access to the dictionary 
file and to the CST. Neither Reader nor Reporter know how the dictionary is 
implemented or how the concordance is produced. This is the most crucial instance of 
information hiding in the design. 

3. The Reporter Module 

Reporter receives the profile, which it compares to the profiles of other texts. 
Based upon this comparison, it passes on recommendations and commendations to the 
user. 

4. Low Level Design 

Having decomposed the problem into these three modules, I jotted down a 
plan for the procedures that would comprise them. This plan included the 
input 'output and operations of each procedure. This low level design included all the 
parameters of the procedures and identified the hierarchy between the procedures and 
modules. 

One of the important features built into this low level design was the human- 
factor facility of reporting to the user the progress of the execution of The Stylist. The 
need to send messages to the screen telling the user what was happening required 
counting steps and further input/output, but my experiences with RightWriter, PC- 
Style and the CopyDix prototypes had convinced me that the expense was worth it. 
Psychologically, staring at a blank screen for five minutes seems ten times as long as 
reading a dynamic screen for seven minutes. To provide the user the option of quicker 
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but more boring execution over slower but more engaging execution, I designed a 
facilitv that asked him w'hether he wanted frecjuent reports, occasional reports or one 
report when done. 

With this low level design in hand, I was ready to begin to code. 
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IV. IMPLEMENTATION OF THE STYLIST 



A. CODING 

Because of the experience gained from the CopyDLx prototypes and because of 
the top-down, modular design, coding the Reader and Researcher modules took, only 
two weeks. Prototyping and top-down design allowed me to code quickly and with few 
errors. (The usefulness of these software engineering techniques would also prove itself 
in the testing phase.) As I had the CopyDix series, I implemented The Stylist in a 
series of iterative steps called Style I through Style9. 

Styled brought to light light an unforseen problem: the interaction between the 
facilities of adding new words to the dictionary and creating a concordance. The 
design called for allowing the user to decide whether he wanted to add new words to 
the dictionary and whether he wanted a concordance. These two choices created four 
cases: Growing Dictionary and Concordance; Growing Dictionary and no 
Concordance; Static Dictionary and Concordance; Static Dictionary and no 
Concordance. In the case of Static Dictionary and Concordance, to create a 
concordance, new words found in the user's input text would have to be added to the 
CST. But since the user had chosen not to add new words to the dictionary. The 
Stylist could not prompt him for the values of these new words. This was not a 
problem until the same new word was used again. Researcher would find this new 
word in the CST but would not find any values for the word. This would cause a fatal 
error: "Word. etymology has unassigned value." To avoid this problem, it was 
necessary to add a new field to each of the nodes: Status. The status of a node could 
be: "Valuable", which meant that the node contained values for the other fields; 
"Not_Valuable", which meant it didn't. 

This particular problem grew even more gnarly when I decided to extend the 
Look_Up procedure of Researcher. The orginal design called for Look_Up to try to 
match the input word only with its exact equal in the dictionary. This worked fine, but 
it meant that the dictionary would have to contain all the following variants of "look" : 
"look", "looks", "looked", and "looking". The extension created a new procedure called 
Look_Up_Variants. If the input word didn't match any word in the dictionary. 
Researcher looked up variants of the word. If it found one, the values of the variant 
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were added to the profile. This extension allowed a more compact dictionary and it 
increased the probability that most words would be found, but it complicated the 
problem described in the previous paragraph. In the case of Growing Dictionaiy and 
Concordance, a word like "looked" would have to be added to the CST for the sake of 
a concordance, but now it would also be added to the dictionar>\ Look_up_variants 
would thus eventually gum up the dictionarv' with a lot of useless variants. Solving the 
problem meant adding a third status. The three statuses were now : "Storable", which 
meant the node was a first class citizen, a w'ord which had values and should be stored 
to the dictionaiy" "Not_Storable", which meant that the node was a second class 
citizen, a variant word which had values, but should not be stored to the dictionary; 
and "Xot_Valuable", which meant the word was a third class citizen, a new word which 
had no values, nor should be stored to the dictionary. 

The solution of these problems completed the coding phase of the Reader and 
Researcher modules. The Reporter module remained a stub. The Stylist could not 
make recommendations and commendations to the user until it had established its 
dictionary and built up a history of other profiles. 

B. BUILDING THE DICTIONARY 

The CopyDix series and Style 1 through Style? had built up a dictionary' of some 
3000 words, which contained about 80% or 85% of the words of any given text. Each 
of these 3000 words, however, now required the assignation of five values ; etymology, 
tangibility, difficulty, emotional connotation and vigor. That totals to 15,000 values. 
.After experimenting, 1 developed a system that allowed me to enter those 15.000 values 
in two weeks. First, 1 sorted the dictionary alphabetically and printed out a listing. 
Using Funk and Wagnalfs Collegiate dictionary, 1 checked the etymology and 
highlighted the borrowed words on the print-out. Then I wTote a program called 
.AUTOM.ATE, which displayed a screen for each of the five values of each word and 
required one keystroke to assign a value. Using this system, 1 assigned values to all 
3000 words, going at a speed that relied upon a subjective reaction similar to the 
reaction of a reader scanning a text at a comfortable reading pace. Then I wrote two 
new programs, DIXSPLIT and DIXJOIN. DIXSPLIT split the dictionary into 17 
intersecting subsets : all native words, all borrowed words, all tangible words, all 
intangible words, and so on. Each of these subsets was like a formation of soldiers. 
Any word that didn't belong in that subset stuck out conspicuously. .After correcting 
these mistakes. 1 executed DIXJOIN, which brought the amended dictionary together. 

Now I was ready to begin testing The Stylist. 
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V. TESTING OF THE STYLIST 



A. INITITAL TEST DATA SET 

The initial test data set for The Stylist included both fiction and technical writing. 
Fiction was represented by long passages from two of my novels and 1000-word 
excerpts from the novels of Ernest Hemingway, Raymond Chandler and Kurt 
Vonnegut. Technical writing was represented by about a dozen student essays and 
excerpts from three textbooks on computer science. In both categories, I included 
both good and poor writing. I ran all the te.xts through StyleS, the Reporter module of 
which merely printed out the values of their profiles. StyleS performed robustly. I 
executed it under all four cases pertaining to the dictionary and concordance as well as 
under all three cases pertaining to the frequency of execution status reports. StyleS 
revealed no major flaws. Its shortcomings were well within the scope of the original 
design. (See Appendix A, Suggested Extensions to The Stylist). StyleS suggested some 
fine-tuning changes. 

Two changes dealt with the part of the CST scheme called the FourDix. The 
FourDix contains the most common function words, all of which are four letters or less 
in length. Examples of FourDix words are "the", "a", "then", "or", "he", "she", and so 
on. The FourDix is a separate CST. Words of the user's input te.xt of four or less 
letters in length are first sought for in the FourDbc. This is an economy measure, since 
a large portion of any text is made up of the function words found in the FourDix. 
Another savings of the FourDix is that the concordance is created by an inorder 
traversal of the main CST. This means that the functional words of the FourDix are 
included in the concordance only in the phrases embracing the more significant words. 
Other^^ise, the concordance would have dozens or perhaps hundreds of lines showing 
each use of words like "the". The first fine-tuning change suggested by StyleS dealt 
with the values of the words of the FourDix. These words had values, of course, but 
they tended to obscure the overall picture. Eliminating their values from the tally was 
the first solution. I modified this to adding only those values of personal pronouns, 
which are words of energetic vigor. This is similar to the approach of PC-Style. The 
second fine-tuning change was not adding the length of the FourDix words to the tally 
of word lengths. With the FourDix words, a bar graph of word length usually had two 
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humps : one tall one around three letters and another, shorter one near five letters. By 
eliminating the length of the FourDix words, both fiction and nonfiction writing always 
showed a bell-shaped curve. The bell of fiction writing tends to be a tall one centered 
on five letters/word; the bell of nonfiction writing tends to be a shorter one centered on 
six letters/word. All this means is that fiction writers use shorter words and words of 
more uniform length, whereas nonfiction writers use longer words and words of more 
variable length. As a stylistic measure, its use is limited. As a bar graph, it is a lot of 
fun. 

In any case, the profiles obtained allowed me to examine the true usefulness of 
The Stylist. Thankfully, The Stylist was able to distinguish between the novels of 
Ernest Hemingway and the ramblings of computer science graduates. The profiles of 
each text corresponded closely to my subjective impressions. (This is not surprising 
since the values of the dictionary were also the product of my subjective decisions. In 
this sense. The Stylist is an expert system that has automated my own rules and sense 
of style. The facility of allowing the dictionary to change, however, allows any user to 
adapt The Stylist to his own taste). 

I now could finish coding Reporter. Reporter now took the profile and 
manipulated its values to create an Analysis. This manipulation took into account the 
genre of the user's input text and involved simple weighting factors rather than any 
complex statistical methods. Like the original values of the words of the dictionary, 
these weighting factors depended upon my own subjective impressions, but they also 
derived from the results of the test data set runs. If a technical paper had four times as 
many intangible words than tangible words, it read like so much mush. So I 
incorporated a ratio of 4:1 as the limit of intangible:tangible for nonfiction. I tried to 
be conservative in these weightings, because I always envisioned The Stylist as a 
descriptive rather than a prescriptive tool. The Stylist thus rebukes the user only when 
the qualities of his text fall far outside the bounds of the norm. 

B. FIELD TESTS 

The Stylist was finally ready for field testing. This final phase was meant to 
mimic the introduction of the software product for consumer use. The field testing 
took place in two parts. In the first, two computer science graduates were briefed on 
the program and asked to use it. In the second, 22 papers for an Administrative 
Science composition course were run though The Stylist. 
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1. Field Test One : The Ape Test 

The two computer science graduate students evaluated the program by 
unleashing it on some of their own writing. Their overall reaction was highly positive, 
possibly because The Stylist seemed to approve of their writing. They praised the 
human factors and the graphics. They thought that the breakdowns of the values and 
the concordances were intriguing and offered them insights into their own writing. 
They complained that the report didn't explain the meanings of some of the values. 
They also didn't like the need to remove embedded commands from the input text. 1 
addressed these complaints by adding some explanations to the report and by changing 
Reader so that it didn't signal End_of_Sentence when it encountered an embedded 
command. 

2. Field Test Two : The Writing Class 

A Naval Postgraduate School Administrative Science composition class 
provided 22 short papers for testing. These students are almost all mihtary' officers 
who are considered by their services as top performers. As such, they constitute a 
somewhat literate and professional test group. Their fairly uniform papers were 
apparent attempts to incorporate some of the lessons of good business communication. 
I ran their papers through The Stylist, then provided the reports to the Professor, who 
gave them to the students. I never met the students; neither did they ever use The 
Stylist themselves. Of the 22 students, 12 fiUed out a questionnaire detailing their 
reactions to the reports. 

Most (10 out of 12) had never heard of a style checker before. Most (11 out 
of 12) understood the reports. One student thought The Stylist was worthless: 11 
found some of its features helpful. Six students found the measures of word length and 
sentence length helpful. Nine found the measures of vigor, etymology and difTiculty 
helpful. Six liked the concordance; four didn't think it was useful. Interestingly, only 
five thought The Styhst accurately reflected the qualities of their writing; three were 
sure it didn't; three just weren't sure. One common complaint was that they weren't 
sure of the meanings of some of the categories. 

Overall, this test underscored the need for a user's manual. Appendix B is just 
such a manual. If this manual had been available for the students or if they had had 
more experience using The Stylist, then they would have better understood the 
meaning of the reports. Despite this drawback, it was obvious from the comments in 
the questionnaire that the majority of the students hked The Stylist and would want to 
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use it or similar programs to analyze their writing. This test, therefore, indicated that 
The Stylist would be able to find a place in the classroom and possibly even the 
market. 



VI. CONCLUSIONS 



Researching, designing, coding and testing The Stylist was an excellent academic 
exercise. It brought home many of the lessons of software engineering. It sharpened 
my skills. The final product. The Stylist, seems a success in that its affective modeling 
works. As such. The Stylist should serve as a contribution to the development of other 
style checkers, rather than a stand-alone style checker itself As to whether writers, 
teachers and students should use computers to analyze writing, Tm reminded of the 
following passage from Plato's "Phaedrus": 

Socrates : 

At the Egyptian city of Naucratis, there was a famous old god, whose name 
was Theuth ... his great discovery was the use of letters. Now is those days the 
god Thamus was the king of the whole country of Egypt ... To him came 
Theuth and showed his inventions . . . when they came to letters, This, said 
Theuth, will make the Egyptians wiser and give them better memories; it is a 
specific both for the memory and for the wit. Thamus replied: O most ingenious 
Theuth, the parent or inventor of an art is not always the best judge of the utility 
or inutility of his own inventions to the users of them. And in this instance, you 
who are the father of letters, from the paternal love of your own children have 
been led to attribute to them a quality which they cannot have; for this 
discover}' of yours will create forgetfulness in the learners' souls, because they 
will not use their memories; they will trust to the external written characters and 
not remember of themselves. The specific which you have discovered is not an 
aid to memor}', but to reminiscence, and you give your disciples not truth, but 
only the semblance of truth; they will be hearers of many things and will have 
learned nothing; they will appear to be omniscient and will generally know 
nothing; they will be tiresome company, having the show of wisdom without the 
reality. (Ref 18] 

In the founh century before Christ, wise Athenians were debating the uses and 
evils of literature itself As Plato points out in the above passage toward the end of his 
dialogue, literature, the act of writing down our ideas, can have its pitfalls. It can 
weaken our memories. (Let anyone who has never forgotten where he parked his car 
argue this point.) It can also lead to intellectual cheating. The wisdom of a man 
capable of piecing together an article with the aid of a library and a long weekend may 
indeed be less than the wisdom of a man who can stand before a learned crowd and 
speak an intelligent discourse. 
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Nevertheless, civilization has embarked on a course inseparable from literature 
and WTiting. Wealth, power and knowledge has followed the progress of the witten 
word. Literature has made us strong, but to the extent that we rely upon books 
instead of our minds, we have grown weak. Now, at what we call the dawn of the 
information age, we would do well to remember the reservations of Plato. Electronic 
computation offers us dazzling abilities, but to the extent we rely upon it instead of our 
minds, we will grow w'eak. 

Therefore, any style checker or CACI program should be used as an interesting 
tool that provides a fresh perspective on writing. These programs cannot take and 
should not be put in the place of thoughtful readers, editors, teachers and friends. 
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APPENDIX A 

SUGGESTED EXTENSIONS TO THE STYLIST 



1. ABBREVIATIONS 

Reader's Read_Intext procedure ignores embedded commands such as 
".embedded" by keeping count of the number of words read since the end of the last 
sentence. If only one word has been read when another period is encountered, 
Read_Intext does not signal end_of_sentence. This simple feature also allows 
Read_Intext to ignore the ellipsis It doesn't always allow it to ignore 

abbreviations such as "Mr.", "Dr.", "etc.", or "e.g.". If these abbreviations appear as 
the first word of this sentence, Read_Intext doesn't signal end_of_sentence. If the 
abbreviation appears somewhere in the middle of the sentence, Read_Intext does signal 
end_of_sentence. An example is the following sentence : "You should have told Prof 
Wu about this earlier." Read_Intext would signal end_of_sentence at "Prof" and 
"earlier.". This is a shortfall of the Reader module, but one which I haven't corrected 
for four reasons: first, such occurrences are relatively rare; second, breaking the 
sentences into two always favors the user; third, the length of sentence measure is not 
the thrust of The StyUst; fourth, the solution would involve unwanted overhead in 
computation. 

Writer's Workbench solves this problem for at least 48 abbreviations. 
Presumably, every' time a period is encountered. Writer's Workbench checks a 
dictionary of these 48 abbreviations. If the word preceding the period matches one of 
these abbreviations, then Writer's Workbench does not count this as an end to a 
sentence. A similar solution could be implemented for The Stylist. 

2. TRANSLATION TO TURBO PASCAL 

The design of The Stylist kept in mind the working environment of a personal 
computer with 1.5 megabytes of main storage. A good follow-on project for The 
Stylist would be to translate the code from Waterloo Pascal to Turbo Pascal. Copies 
of this program could then be distributed and possibly even marketed. 

3. IMPROVEMENTS OF THE CONCORDANCE 

In addition to the main CST and the FourDLx, a third CST could be added. This 
new CST's node's key values would be characters of punctuation. During the 
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execution of Inorder_Concordance, the procedure that creates the concordance, the 
traversals "lastward" and "nextward" could terminate upon encountering a character of 
punctuation in this third CST. This would mean the lines of the concordance would 
only contain phrases extracted from single sentences. This may or may not be a 
desirable feature. I myself found it useful to read entire phrases, even when they 
overlapped into preceding or succeeding sentences. 

More sophisticated concordances include at the end of each line a note as to 
where this line can be found. The third CST outlined above could contains as a field in 
its main node just such a note, with the name of the user's input text, the page number 
and the line number. Such an extension would be valuable for scholarly research. For 
the short, single input texts, such notes are not necessary. 

4. PARTS OF SPEECH 

A considerably more ambitious extension would be to provide The Stylist the 
facility of determining the parts of speech. Other programs [Ref 8] [Ref 9] [Ref 10] 
solving this problem use thousands of rules. The disambiguation of parts of speech 
would raise The Stylist to a whole new order of complexity. It would also allow more 
fine analysis of the characteristics of the words. For example, "like" as a verb has 
much different connotations than "like" as a preposition. This facility would also 
allow other stylistic measures to figure into the profiles, such as, verb-adjective ratio 
and percentage of modifiers. 

5. PASSIVE VOICE 

The Researcher module could be extended to detect the use of passive voice. 
Ideally, this would be accomplished in the context of the extension involving parts of 
speech, so that active verbs would also be detected. A simpler solution is readily 
available, however. Researcher could raise a flag every time it receives a form of the 
verb, "to be": that is, "is", "are", "am", "was", "were", "being", and "been". This flag 
would remain raised for the next three words. If any of these three words ended in 
"-ed ", then Researcher would count this as an example of the use of the passive voice. 
.Again, I haven't implemented this solution because it is not central to the concept of 
affective modeling and because I didn't want to pay for the extra computation. 

6. CORRECTION OF POOR DICTION AND GRAMMAR 

.Another ambitious extension of The Stylist would be to provide it with the 
facility of correcting poor diction and grammar. Both RightWriter and DICTION of 
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Writer's Workbench contain dictionaries of commonly used cliches or repetitive 
redundancies. Some examples from DICTION are : "a great deal of', "in regards to", 
"make adjustments to" and so on. One solution for The Stylist would be to include a 
new field in the dictionary. Words like "deal", "regards " and "adjustments" would 
contain in this field a pointer to a table containing the poor phrases and their 
replacements. Such a solution would also require Researcher to maintain a phrase of 
the three or four most recently received words, so that it could verify that "regards" 
appeared in the offending context of "in regards to". 

Grammatical corrections would require an even more ambitious extension. To 
perform properly, such a facility would have to be an extension of the parts of speech 
extension. Determining agreement between subject and verb, for example, would 
require first the identification of the subject and the verb. Frankly, it would be easier 
to incorporate the affective modeling of The Stylist into a more extensive program such 
as Writer's Workbench than it would be to extend The Stylist this far. 
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APPENDIX B 



USER'S MANUAL 



1. INTRODUCTION 

The Stylist is a Waterloo Pascal program that analyzes the style of English prose, 
both fiction and nonfiction. The Stylist package contains the following : this User's 
manual, and the following electronic files ; the Waterloo Pascal code of The StyUst 
itself, the text file dictionary of The Stylist, and two Waterloo Pascal programs for 
helping to maintain the dictionary’, DIXSPLIT and DIXJOIN. 

The fundamental idea of The Stylist is that individual words have power. The 
denotation of a word is its meaning. The connotation of a word is its emotional 
impact. For example, "pupil", "student", and "scholar" all denote a person who studies 
and learns. The emotional connotations, however, range from the humble "pupil" 
through the famihar "student" to the lofty "scholar". Besides these emotional 
connotations, words have other intrinsic values which can be quantified. The Stylist 
concentrates on the following values: Etymology, Tangibility, Difficulty, Emotional 
Connotation and Vigor. 



a. Etymology 

One of the beauties of the English language is that its vocabulary embraces 
two main sources : native and borrowed. Our native words mainly come to us from 
the Anglo-Saxon tongues. Borrowed words come from Latin, mostly by way of the 
French of the Norman conquest. See Table 7, which illustrates the difference. 



TABLE 7 

EXA.MPLES OF ETY.MOLOGY 



Native Words 



Borrowed Words 



Man 



Finger 

Thinker 

Fire 



Person 

Digit 



Philosopher 

Conflagration 

Volume 

Miscreant 



Book 

Dirtbag 
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Native words tend to be short, strong and rough. Borrowed words tend to be 
long, mild and elegant. Good English prose, particularly good fiction, tends to use 
native words. Poor English prose, particularly bad technical writing, e.xhibits a 
tendency to overutilize latinate etymology. By counting up the number of native and 
borrowed words in your input text. The Stylist sees how it compares to good writing of 
the appropriate genre, 
b. Tangibility 

A word can either evoke an image of a thing or it can refer to an idea. The 
former words are "tangible" and the latter are "intangible". Table 8 illustrates the 
difference. 



TABLE 8 




EXAMPLES OF TANGIBILTY 


TANGIBLE 


INTANGIBLE 


Rock 


Ethereal 


Lips 


Automatically 


Beehive 


Then 


Corvette 


Rely 


Hammer 


Preliminary 



Tangible words are concrete, exact and evocative. Intangible words are ideal, 
general and cognitive. All writing uses both. Good writing, however, usually takes 
advantage of tangible words. Even the most philosophical writing benefits from the 
use of tangible words. Good fiction rarely strays too far toward the intangible. The 
Stylist counts the tangible and intangible words of your input text and compare this 
count to good writing of the appropriate genre, 
c. Difficulty 

This is one of the most indisputable characteristics of words. The Stylist 
defmes four levels of difficulty : Elementary', High Schdol, Graduate and Postgraduate. 
The words of each level are those most likely to be used in speech with ease by an 
average person of that educational level. Table 9 illustrates the categories. 

By counting the occurrences of difficult words. The Stylist could determine the 
overall difficulty of the vocabulary' and thus the readibility of your text. 
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TABLE 9 




1 


EXAMPLES OF DIFFICULTY 




ELEMENTARY 


HIGH SCHOOL 


GRADUATE 


POSTGRADUATE 


i Big 

Sister 

Spoon 

Flying 

Wish 

Done 

Handsome 


Tardy 

Rendezvous 

Transmission 

Foreman 

Process 

Joyous 

Undergo 


Matrix 

Immaterial 

Tonality 

Universal 

Induction 

Processor 

Linear 


Execrable 

Parsimony 

Recursive 

Homomorph 

Dilatory 

Zygote 

Synergy 



d. Emotional Connotations 

The Stylist defines five categories of emotional connotation : Sublime, 

Pleasant, Neutral, Unpleasant and Horrid. Table 10 illustrates the categories. 
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TABLE 10 








EXAMPLES OF EMOTIONAL CONNOTATION 




SUBLIME 


PLEASANT 


NEUTRAL 


UNPLEASANT 


HORRID 


Beauty 
1 Sunrise 
Victory 
! Love 
God 

Paradise 

1 

( 


Happy 

Food 

Friendly 

Warm 

Helpful 

Sex 


The 

Which 

Brick 

Is 

Name 

When 


Damage 

Loss 

Insulting 

Loser 

Cost 

Wound 


Cancer 

Murder 

Whore 

Fucks 

Death 

Traitor 



By counting the use of these types of words. The Stylist could determine the overall 
emotional tone of your passage. It could also detect flat or highly emotional writing, 
e. Vigor 

A related but distinct characteristic is vigor. Sublime and horrid words tend to 
be highly vigorous, but not all highly vigorous words are emotional. Examples of 
vigorous but unemotional words are "sprint", "rush" and "cross". This category also 
tends to be more objective. For example, the word, "soldier" can have widely different 
connotations for different people. The word "soldier" would please a career Army 
officer but it w'ould displease a survivor of the Japanese occupation of Canton. Both 
could agree that "solider" is a vigorous word. 



•I 4 

1 
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TABLE 11 








EXAMPLES OF VIGOR 




VIOLENT 


ENERGETIC 


CALM 


INERT 


Destroy 

Creation 

Fire 

Atomic 

Holocaust 

Conqueror 


Sprint 

Dive 

Discipline 

Wedding 

Steam 

Flying 


Read 

Write 

Manager 

Ocean 

Blue 

Sought 


From 

Into 

Something 

Comma 

Paper 

Format 



By calculating the vigor the words of your input text, The Stylist would be 
able to estimate its overall strength. 



2. USING THE STYLIST 

The first step to using the Stylist is to save your piece of writing as "Input text 
a". You should be aware of the following principles: 

1. Your input text should be at least 500 words long. The Stylist will examine 
shorter texts, but the statistical sample of shorter texts is too small for valid 
analysis. 

2. Your input te.xt should not be longer than 500 sentences (about 10.000 words). 
Such long texts require so much computation that most operating systems such 
as MVS will terminate execution before completion. 

3. The Stylist is not meant to analyze non-prose constructions such as tables, lists, 
references and bibliographies. You should eliminate these from your input text. 

In the multiprocessing environment, DEFINE MAIN STORAGE at 1500k. The 
VM/CMS command is "Define storage 1500k" followed by "I CMS". 

Once your input text is properly stored, execute The Stylist. DO NOT interrupt 
The Stylist during compilation or execution by hitting any keys. Always wait for The 
Stylist to display a screen with a prompt. After The Stylist is compiled, you will see 
just such a series of screens that will ask you for information. The following 
discussions will help you decide how to answer. 

a. Name of Intext 

The first screen will ask you for the name of your input text. Simply type in 
the name. You can use any characters. The maximum length of the name is 40 
characters. Hit "enter". 
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b. Fiction or NonFiction 

The screen will automatically clear. A new screen will ask you if your input 
te.xt is fiction or nonfiction. Enter "1" or "2". If you enter any other character, The 
Stylist will merely ask you again. 

c. Report Frequency 

The next screen asks whether you want reports "frequently", "seldomly" or 
"when done". A report is a screen which The Stylist displays to you during execution, 
telling you what it's doing, how much it's done and how much remains to be done. 
The act itsell' of producing such takes time : the more frequent the reports, the longer it 
takes. You should probably enter "2" for "Seldomly". Most users find this the most 
agreeable. 

d. Concordance 

The ne.xt screen asks whether or not you want a concordance. A concordance 
is an alphabetical listing of all the significant words of your input text. ("Significant" 
words are all words other than functional words such as "the", "and", "as", and so on.) 
Each line of the concordance contains the key word and the phrases preceding and 
succeeding it. The following is an example of a concordance. 

them and thamus enquired ABOUT their several uses and 

discovered is not an .MD to memorv’ but to 

other egyptians might be .ALLOWED to have the benefit 

an art is not .ALW.AYS the best judge of 

is called by them AMMON to him came theuth 

learned nothing they will .APPEAR to be omniscient and 

censured others as he .APPROVED or disapproved of them 

many arts such as ARITHMETIC and calculation and geometr>’ 

or inventor of an ART is not always the 

the inventor of many ARTS such as arithmetic and 

blame of the various .ARTS but when they came 

calculation and geometry and ASTRONOMY and draughts and dice 

have been led to ATTRIBUTE to them a quality 

in the learners' souls BECAUSE they will not use 

your ovTO children have BEEN led to attribute to 
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Most users find that a concordance offers them interesting insights into their 
use of words. The only drawback of producing a concordance is that it takes The 
Stylist additional time. If you elect this option, your concordance will be printed to a 
separate file called, "Concrdnc text a", 
e. Number of Words in Inte.xt 

The next screen asks you for the number of words in your text. If you don't 
know, make any guess. This number is only used in the screens displaying how far 
along The Stylist is in reading the te.xt. Your guess will have no effect on the analysis 
of the text. The Stylist counts the words itself and will report the e.xact number to you 
when it finishes. 

f Expand the Dictionary 

The next screen will ask whether you want to expand the dictionary of The 
Stylist or not. This dictionary is a listing of about 3200 of the most commonly used 
words, together with five values for each word. If you answer "1" for "Yes", the Stylist 
will ask you to give it a values for every word in you input text which is not found in 
the dictionary. It will then add these new words and their values to the dictionary. If 
you answer "2" for "No", then The Stylist will write all of the unknown words to a file 
called "N'ewwords text a". Reviewing "Neww'ords" will give you an idea of the words 
that you use which aren't in the dictionary. 

The first few times that you use The Stylist, you probably should answer "2" 
for "No". Entering the values for the new words can be tedious. You probably want 
to see how The Stylist works before you begin to expand or modify the dictionary. 
After you've used The Stylist for a while, you'll probably want to begin adding some 
words to the dictionary. Words peculiar to your field of writing or the jargon of your 
profession are examples of the types of words which should be entered. Adding them 
will tailor The Stylist to your working environment. Keep the following in mind : 

*** Every word added to the dictionary is another word 
that will have to be loaded into memory during each 
execution of The Stylist. .Adding many words will slow' 
dow'n execution and it will decrease the amount of space 
in main storage. If the dictionary grows beyond 5000 
w'ords, you should weed out words that are rarely used 
in your type of writing. To do this, simply delete 
that line from the file "Dixonary text a". *** 
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If you answer "I" for "Yes", as the Stylist encounters each new word in your 
text, it will ask if you want to add this particular word to the dictionary. If the word is 
a misspelling or if it is a word that you rarely use, don't add it. Also, you should know 
that its best to add the root form of words. For example, neither "burns" nor "burned" 
nor "burning" should be added. It's much better to add "burn". To keep the 
dictionary' as compact as possible, note down the root form of words you wish to add 
to the dictionary and append them to the end of the next text you run through The 
Stylist. 

The Stylist will then ask you for the values of the words you do want to add 
to the dictionary. The following will help you decide how to answer these prompts : 

1. Source 

If you're uncertain about the etymology of the word, simply look it up in a 
collegiate dictionary'. Old English, .Middle English, Norse, German, Old German, 
.Middle German and all Celtic languages are considered "Germanic". Latin, French, 
Spanish, and all other languages are considered "Latin". 

2. Difficulty 

Each category contains those words which are most likely to be used in 
speech by persons of that educational level. If you're still uncertain of the meaning of 
these categories, the following are examples of the categories : 

POSTGFLADU.ATE DIFFICULTY 

aggregate 

algorithm 

algorithmic 

ambiguity 

analogue 

applicative 

conceptualize 

conglomerate 

constriction 

consummate 

consummation 

convoluted 

convolution 
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culminate 

cyclical 

digitize 

GR.ADUATE DIFFICULTY 

accumulate 

acrid 

acute 

adhere 

adjoin 

allocate 

allude 

analog^’ 

analysis 

analyst 

ancestor 

ancestral 

annihilate 

apparatus 

appendage 

arc 

arch 

array 

HIGH SCHOOL DIFFICULTY 

abandon 

ability 

able 

abrupt 

absence 

absent 

absorb 

acceptable 

access 
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accommodate 

accomplish 

accomplishment 

account 

accurate 

achievement 

activate 

adapt 

additional 

adjust 

advantage 

ELEMENTARY DIFFICULTY 

about 

above 

accept 

acceptance 

ache 

across 

act 

action 

active 

activity 

actor 

add 

addition 

address 

admit 

advance 

adventure 

afraid 

after 

afternoon 

again 



i. Concreteness 

Does this word name a thing that you can touch? If it does, then answer 
"1" for "Tangible". Otherwise, answer "2". 

4. Emotional Connotation 

What feelings does this w'ord evoke in you? If none, answ^er "3" for neutral. 
Otherwise, answer in the appropriate category. 

5. Vigor 

Vigorous words are not necessarily verbs. Words like "conquest" are highly 
vigorous. "Violent" vigor means "highly" vigorous. Some of the w'ords in the "violent" 
category are "creation", "triumph" and "epiphany". This category, as does emotional 
connotation, calls for your subjective reaction. Don't be afraid to "guess" which value 
is appropriate. Such a "guess" is in fact your subjective reaction, 
g. The End of Execution 

The above are all the screens which prompt you for answers. All the rest of 
the screens will merely inform you about the progress of the execution. When 
execution ends, you should look into the following files : "Newwords te.xt a", if you 
decided not to let the dictionary expand; "Concrdnc text a", if you elected to have a 
concordance produced; and most imponantly, "Report text a", w^hich is The Stylist's 
report on your text. 

3. THE MEANING OF YOUR REPORT 
a. Measures of Length 

The first two pages of your report deals with the number of w'ords. the length 
of words and the length of sentences. These measures are always interesting but not 
always illuminating. The first bar graph shows the distribution of the length of the 
words of your text. Each horizontal column represents the percentage of the words of 
your next which were of so many letters of length. The overall bar graph should look 
like a bell-shaped curve. This bar graph does not include function words such as "a", 
"an", "the", "then" and "or". The blurb below the bar graph explains how your text 
compares to other texts of the same genre. The only clear indication of prose in 
trouble would be a dramatic slewing of the curve to the right. This would mean that 
you're using too many windy w^ords. Table 12 breaks down the characterizations of 
word length. 



47 



TABLE 12 
WORD LENGTH 



Average Word Length 
( lower boundary) 

Characterization Fiction Nonficrion 



Too Long 
Long 
Medium 
Short 
Too Short 





The second bar graph shows the length of the sentences of your input text. 
Each bar represents the length of a sentence. You can use this graph to identify run- 
on sentences. Overall, this bar graph should resemble the skyline of a modest city such 
as San Francisco. If it resembles New York, then you tend to write long sentences. 
Table 13 explains the characterizations of sentence length. 





TABLE 13 


SENTENCE LENGTH 


i Characterization 


Lower boundary of length 


j 

Too Long 


22 


Long 


18 


Medium 


15 


Short 


10 


Too Short 

1 

I 


0 



One of the measures below this bar graph is "Number of sweet spots". This is 
the number of sentences ranging between 9 and 19 words. The idea here is that most 
ideas can find a home in a sentence of this length. Table 14 breaks down how The 
Stylist characterizes the modulation of your sentences. 

Another count here is the number of run-on sentences. The Stylist defines 
anv sentence over 45 words in length as a run-on. The Stylist realizes that manv grand 
sentences over 45 words in length are not run-ons, but it's willing to bet that most 
overlong sentences are. 
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TABLE 14 
MODULATION 



Percentage 

Above 50% 
20% to 50% 
Befow 20% 



Characterization 

Good 

Average 

Poor 



b. Measures of Word Characteristics 
7. Etymology 

The next item in the report is the number of Latinate and Germanic words. 
Notice that these two numbers will not add up to the total number of words in the text 
unless ever>' word in the text happened to belong to the dictionary. The 
characterization of your text in this category depends on the ratio of latin to germanic 
words. Good writing generally tends to be native. Table 15 breaks down these 
characterizations. 



TABLE 15 

CHARACTERIZATIONS OF ETYMOLOGY 




Ratio of 


Latin to German 


Characterization 


Fiction 


NonFiction 


Very Borrowed 


2: 1 


3: 1 


Borrowed 


1. 5: 1 


2: 1 


Mixed 


1: 1 


1: 1 


Native 


. 5: 1 


. 9: 1 


Very Native 


<. 5: 1 


<. 9: 1 



2. Difficulty 

The next entry show’s the tally of the numbers of w'ords of the various 
levels of difficulty, followed by the percentage of those levels. The difllculty of the 
text's vocabulary should be appropriate for its intended audience. Moreover, you 
should remember than many outstanding works of literature convey difiicult ideas 
w’ithout resorting to difficult words. Table 16 describes the characterizations of this 
category. 
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TABLE 16 

CHARACTERIZATIONS OF DIFFICULTY 



Characterization 

Very Hard 
Hard 

Challenging 

Easy 



Cause 

%postgrad > 0 
^rad > 5 

%highschool > 10 
above percentages 
do not pertain 



3. Tangibility 

The next entries show the total of the tangible and intangible words. As in 
the etymology categoiy, the characterizations depend upon the ratio between these 
words and the genre. Generally, good WTiting is tangible. 



I 

1 TABLE 17 

CHAR-ACTERIZATIONS OF TANGIBILITY 


Characterization 


Ratio of Intangible to Tangible 
Fiction Nonfiction 


Very tangible 


>1.5:1 >2:1 


Tangible 


1. 5:1 2:1 


1 Very intangible 

1 


3:1 4:1 



4. Emotional Connotation 

The next breakdown is of the number and percentages of the words 
belonging to the various categories of emotional connotation. Following these values 
is something called the "Index of Emotion". The following is the formula used to 
derive this index : 

Index = (Percent Horrid times 5) + 

(Percent Unpleasant times 2) + 

(Percent Pleasant times 2) + 

(Percent Sublime limes 5) 
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The Index of Emotion is simply a number that conveys a sense of the use 
of strongly emotional words. The most emotional words, Horrid and Sublime, are 
weighted the most heavily. This Index allows The Stylist to compare your use of 
emotional words to the use found in other texts. Table 18 shows the thresholds of 
such uses. 



TABLE IS 

CHARACTERIZATIONS OF EMOTIONALITY 

Characterization Lower Boundry of Index 

Rich 20 

Average 10 

Poor 0 



Following this characterization is another one, this time for the overall tone 
of your text. The Stylist compares the frequency of use of positive words to the 
frequency of use of negative words. If positive words prevail, the tone is positive. If 
negative words prevail, the tone is negative. If the two balance and the index of 
emotion is high, then the tone is characterized as a balance of strong positive and 
strong negative emotions. If the two balance and the index of emotion is low, then the 
tone is characterized as bland. 

5. Vigor 

The final breakdown is that of the number and percentage of words of the 
various categories of vigor. Following this is an "Index of Strength", which is a 
measure similar to the Index of Emotion. The following is the formula used to derive 
the Index of Strength : 

Index of Strength = (Percent of violent words times 10) + 

(Percent of energetic words times 5) + 

(Percent of calm words) 

This Index allows the Stylist to compare your use of vigorous words to uses 
found in other texts. Table 19 explains how The Stylist uses the index to characterize 
your text. .Most good writing is strong or lively. Usually only action-packed fiction 
reaches the upper registers of "very strong". 
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TABLE 19 

CHAR-ACTERIZATIOXS OF VIGOR 



Characterization 



Lower Boundry of Index 



Very Strong 
Strong 
Lively 
Weak 



60 

50 

20 

0 



6. Recommendations 

The final section of your report includes recommendations and 



commendations. The Stylist is meant to be a descriptive rather than a prescriptive 
tool. Therefore, it makes recommendations only when some aspect of your writing 
seems well outside of the bounds of the normal. For this reason, you should consider 
the recommendations of The Stylist. They apply only to what appear to be extreme 
cases. On the other hand, you should never accept these recommendations as the 
pronouncements of some oracle. The Stylist is merely a machine. When in doubt, 
trust your own instincts. 

4. MAINTENANCE OF THE STYLIST 

a. Care of the Dictionarv’ 

Many of the issues regarding the care of the dictionary have already been 
covered. Here are some other helpful pointers : 

1. Always maintain more than one copy of the dictionary. Loss of the dictionary’ 
renders The Stylist useless. 

2. Beware of changing the values directly. You should never do this to the 
dictionary itself While using DIXSPLIT, it is necessary to go into the split 
files and change the codes for the words which don't belong in that split file. If 
you make a mistake, DIXJOIN will merely assign the most common value for 
that category. 

b. Changing the Code of The Stylist 

The Stylist comes with embedded documentation. .Make sure that you read 
this documentation prior to changing any of its code. You should read the full 
master's thesis pertaining to The Stylist prior to making any substantive changes.^ 



^See .Vlaster's Thesis : The Stylist : A Pascal Program for Analyzing Prose Style, 
by Lt. Thomas C. Cool, USX, Naval Postgraduate School, Monterey, California. 1987. 







TABLE 20 
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CODES OF THE DICTIONARY 


1 

1 






COLUMN 






First 


Second 


Third 


Fourth 


Fifth 


Etymology 


Tangibility 


Difficulty 


Emotion 


Vigor 


1( atin) 
g( erman) 


t( angible) 
i( ntang. ) 


e( lement. ) 
h( igh sc. ) 
g rad) 
p( ost gr. ) 


h( orrid] 
u( nple. 
n( eutr. 
p( leas. 
s( ubli. ] 


1 i(nert) 

1 c( aim) 

1 e( nerg. ) 

1 v( iole. ) 

i 



Substantive changes should also bear in mind the modularization of the 
design. Reader module bears full responsibility for reading the user's input text. It 
passes along words (in lower case) and their length. It also signals the ends of 
sentences and the end of the file. Researcher Module bears full responsibility for 
maintaining and using the dictionary, for building up the profile of the user's input 
text, and for building the concordance. Finally, Reporter module accepts the profile, 
creates the analysis and writes out the report. 

Some fine tuning of The Stylist is readily available. The global constants give 
you the capability to adapt The Stylist to your operating environment. For example. 
"Maxsent" allows you to determine the maximum number of sentences in the input 
text. Moreover, by manipulating the weighting factors in the Calculate procedures of 
the Reporter module, you can adapt The Stylist to the norms of a particular school of 
writing, such as submissions to magazines as varied as "Fantasy and Science Fiction" 
and "Foreign .Affairs." 
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APPENDIX C 

THE CODE OF THE STYLIST 



(X NOTE : THE FOLLOWING IS NOT IN EXECUTABLE ORDER. 

IT IS IN THE ORDER OF GREATEST CONCEPTUAL CLARITY, x) 



(3($S60000X) 



(X THE STYLIST x) 

( X X) 

(X a Pascal Program for Analyzing Prose Style x) 

(X by Lieutenant Thomas C. Cool^ USN X) 

(X Submitted in partial fulfillment of the requirements X) 

(X for the degree of X) 

(X MASTER OF SCIENCE IN COMPUTER SCIENCE x) 

(X from the X) 

(X NAVAL POSTGRADUATE SCHOOL X) 

(X June 1987 x) 

(X X) 



program Stylist (input, output); 
const 

maxsent = 500; (x limit of number of sentences in intext X) 

v^vordlength = 15; 
linelength = AO; 

phraselength = A; (X length of phrase around condcordance linex) 



freqnum = 50; (X the three numbers determine how often X) 
selnum = 200; (X Stylist reports to user during executioniX) 
whendonenum = 1000000; (X frequently, seldomly or when done X) 



type 

(X GLOBAL TYPES x) 



Genretype = (Nonfiction, Fiction); 

Frequencytype = (Frequently, Seldomly, When_Done); 

Wordtype = packed array ( . 1 . . wordl ength . ) of char; 

Linetype = packed array (. 1 .. 1 i nel ength . ) of char; 

User_Inf otype = record 

Name_of_text : linetype; 

Genre : Genretype; 

Size : Integer; (x users guess of intext length x) 

Frequency : Frequencytype; (X how often user wants reports X) 

Want_Concord : boolean; 

Want_Dix_Grow : boolean; 
end; 



(X READER MODULE TYPES X) 

Readtype = record (X created by Reader, passed to Researcher x) 

Word : wordtype; 

Length : integer; 
end; 
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RESEARCHER MODULE TYPES 









Balancetype = (Plus^ Zero, Minus); Balance of Node in BST JO 

Statustype = (Storable^ Not_Storabl e^ Not_Va luable) ; 

(J( Storable - a full node with all values which 
• (J^ can be added to the dictionary 

(J^ Not_Storable - a node of a variant (-ing, -ed) 

(J^ word which has values but should J^) 

(J( not be added to dictionary J^) 

(J^ Not_Valuable - a node of a new word without 

(J^ values. Cannot be Add_Valued and J^) 

(J^ cannot be added to dictionary. 

Sourcetype = (Latinate, Germanic); 

Dif f i cul tytype = (PostGrad, Grad, High_School, Elementary); 
Concretenesstype = (Tangible, Intangible); 

Emotiontype = (Sublime, Pleasant, Neutral, Unpleasant, Horrid); 
Vigortype = (Violent, Energetic, Calm, Inert); 

(J^ tags which describe each word in the dictionary J^) 

EntryPointer = -Entrytype; 

ConcordPointer = ^Concordtype; 

Entrytype = record (J^ main node in the BST JO 

Nord : Wordtype; ( J^ word is key field Jf) 

Balance : Balancetype; 

Status : Statustype; 

Source : Sourcetype; 

Difficulty : Di ffi cul tytype; 

Concreteness : Concretenesstype; 

Emotion : Emotiontype; 

Vigor : Vigortype; 

Left, Right : EntryPointer ; 

Down : Concordpointer; 
end; 

Concordtype = record ( J^ a Concord node is a cluster of pointers J^) 

Up : EntryPointer ; ( J^ which create linked lists from the firstJ^) 

Down, ( J^ word of the Intext to the last, and fromJ^) 

Next, (J^ the last to the first. Used only to J^) 

Last : ConcordPointer ; (J^ create the concordance. J^) 

end; 
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L trsPerWordtype = array ( . 0 . . wordlength . ) of integer; 
array of total number of words of each length 

WordsPerSenttype = array ( . 0 . . maxsent . ) of integer; 
array of the length of each sentence 



Profiletype = record 



the profile is the main product 
of the Researcher Module. Here x) 
Researcher keeps track of all 
counts. 



Totalwords : integer; 
Totalletters : integer; 
Totalsentences : integer; 
TotalStrucl trs : integer; 
NumWordsThisSent : integer; 
Newwords : integer; 

KnownWords : integer; 
VariantNords ; integer; 

St ructurewords : integer; 
UnknownWords ; integer; 

Numfound : integer; 

LtrsPerWord : 1 trsperwordtype; 
WordsPerSent : wordspersenttype; 
Ave_l trs_per_word : real; 
Ave_wrds_per_sent : real; 



Total number of words in intext X) 



Total of Itrs of structural wrds 
# words in sentence being read 
words user adds to dixonary X) 
words of intext found in dixon.X) 
variant words of intxt in dixon^) 
stucture words of intxt in Adix^) 
intext words not found anywhere^) 
total words minus unknown words^O 



rest of profile are counts of 
words of the intext# found in x) 
(X the dictionaries, which have 
these corresponding values 



NumLatinate : integer; 

NumGermanic : integer; 

NumPostGrad : integer; 

NumGrad : integer; 

NumHi gh_School : integer; 

NumEl ementary : integer; 

NumTangible : integer; 

NumI ntangibl e ; integer; 

NumSublime : integer; 

KumPleasant : integer; 

NumNeutral : integer; 

NumUnpl easant : integer; 

NumHorrid : integer; 

NumViolent : integer; 

NumEnergetic ; integer; 

NumCalm : integer; 

Numlnert : integer; 
end; 

Phrasetype = packed array ( . 1 . . phrasel ength . ) of wordtype; 

holds the phrases bracketing the key words of concordance line 
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(X 



REPORTER MODULE TYPES 



X) 



Lengthtype = (TooShort, Shorts Medium, Long, TooLong); 
Etymologytype = (TooBorrowed, Borrowed, Mixed, Native, TooNative); 
Modulationtype = (Bad, Average, Good); 

Runonstype = (Unacceptable, Acceptable, Nonexistent); 

Tangibili tytype = (Soft, firm. Solid); 

Strengthtype = (Weak, Lively, Strong, VeryStrong); 

HardWordstype = (Easy, Challenging, Hard, VeryHard); 
Emotionalitytype = (Poor, Standard, Rich); 

Tonetype = (Negative, Bland, Positive); 

(X Judgements of intext style Reporter makes based on profile ^) 

Analysistype = record 

WordLength : Lengthtype; 

SentLength : Lengthtype; 

Modulation : Modulationtype; 

Runons : Runonstype; 

Etymology : Etymologytype; 

Tangibility : Tangibili tytype; 

Strength : Strengthtype; 

Hardwords : HardWordstype; 

Emotionality : Emotionalitytype; 

Tone : tone^pe; 
end; 
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(X 



GLOBAL VARIABLES 



X) 



van 

Info : User_Ihf otype; 

Allswell : boolean; global flag that execution can continue 

(X used instead of a GOTO end of program X) 



FourRoot / 

Root : EntryPointer ; 
Lastward : ConcordPoi nter ; 



Root of the FourDix BST 
Root of the Dixonary BST 
Pointer to last word read 



Intext, 
Dixonary, 
FourDix, 
Concrdnc, 
NewHor ds , 
Report : text; 



file that holds users prose 
dictionary of words and values 
diction, of short, functional words X) 
(X concordance, created by researcher X) 
list of words not found or added 
(X main product of Reporter 



Tally, Researcher scratch pad of profile 50 

Profile : prof i letype; (5( Researcher input to Reporter 50 

Analysis s analysistype; (5^ Judgements made by reporter 5() 



(5( MAIN PROGRAM 5^) 

begin ( 5( main program 5^) 

Interroga te_User ; 
if Allswell then Init_Reader; 

if Allswell then Ini t_Resea rcher ; 

if Allswell then Ini t_Reporter ; 

if Allswell then Read_Intext; 

If Allswell then Calculate^Prof ile; 

If Allswell then Ana lyze_Prof i 1 e_and_Report ; 

If Allswell then if Inf o . Want_Concord then 

Wri te^Concordance; 

If Allswell then if Inf o . Want_Dix_Grow then 
Store_New_Dix; 

If not Allswell then 

Writeln (’Execution terminated until you clear that up.’); 

end . 
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procedure Interroga te_User ; 
var c : char; 

good_integer : boolean; 
i, j : integer; 
begin 
Page; 

Writeln; Writeln; Writeln; 

Writeln CTHE STYLIST ^9 ) ; Writeln; Writeln (*by LT TC Cool’:50); 
Writeln; Writeln; Writeln; Writeln; 

Writeln C ’What is the name of the text to be analyzed? *: 65 ) ; 
i : = 0; 

while (i < linelength) and not (EOLN) do begin 
i : = i + 1 ; 

Read ( Inf o . Name_of_Text ( . i . ) ) ; 
end; Readln; 

for j : = i + 1 to linelength do Inf o . Name_of_Text ( . j . ) := * *; 
Page; 

Repeat 

Writeln; Writeln; Writeln; 

Write (*Is S Inf o . Name_of_Text ) ; 

Writeln (’ saved as "Intext text a"? ’); Writeln ; 

Writeln C* 1) Yes *:^5); 

Writeln ( * 2) No ' :45); 

Read (c); Readln; Writeln Cc:45); 

Until (c= '1') or (c = *2'); 

if (c = ’!*) then begin Writeln (’That is good.’:^5); 

Allswell := true; end 

else begin Writeln (’That is too bad.’: A3); 

Allswell := false; end; 

if Allswell then begin 
Repeat 

Page; Writeln; Writeln; Writeln; 

Writeln ( Inf o . Name_of_text : 55) ; Writeln; 

Writeln ( ’ is ’ : A5); 

Writeln; Writeln; 

Writeln (’1) Fiction ’:A5); 

Writeln (’2) Nonf iction ’ : A5) ; 

Writeln; Writeln; 

Read (c); Readln; Wri teln (c:A5); 

if (c = ’1’) then Info. Genre := Fiction 

else if (c = ’2’) then Info. Genre := Nonfiction; 

Until (c = ’!’) or (c = ’2’); 

Repeat 

Page; Writeln; Writeln; Writeln; 

Writeln (’ REPORT ’:A5); Writeln; 

Writeln (’1) Frequently ’: A5 ) ; 

Writeln (’2) Seldomly ’:A5); 

Writeln (’3) When Done ’:A5); Writeln; 

Read (c); Readln; Writeln; Writeln (c); Writeln; 
if (c = ’1’) then Inf o . Frequency := Frequently 
else if (c = ’2’) then Inf o . Frequency := Seldomly 

else if (c = ’3’) then Inf o . Frequency ;= When_Done; 

Until (c = ’!’) or (c = ’2’) or (c = ’3’); 
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Repeat 

Page; Wri tel n; Wri teln ; Wri tel n ; 

Writeln (' CONCORDANCE Writeln; 

Writeln (* 1) Yes *;A5); 

Writeln ( * 2) No * ;^5); 

Read (c); Readln; Writeln (c:A5); 

if (c = •!*) then Inf o . Want_Concord := true 

else if (c = *2’) then Inf o . Want_Concord := false; 

Until (c= *1*) or (c = *2*); 

Repeat 

Page; Writeln; Writeln; Writeln; 

Writeln ( ’ Approxima tely how many words are in •); 

Writeln ( I nf o . Name_of_text , ’ ?•*. 3); 

Info .size : = 0 ; 

Good_integer := true; 
while not EOLN do begin 
Read (c); 

if (c in ( , * 0 • . , * 9 * . ) ) then 

Info. size := Info. size ^ 10 + ord(c) - ord(*0*) 
else begin good_integer ;= false; 

Writeln (c:2/ • is an invalid character . ’ ) ; 

end; Readln; 

Until Good_Integer ; 

Writeln (Info.size:45); 

Repeat 

Page; Writeln; Writeln; Writeln; 

Writeln (’ EXPAND DICTIONARY ? «:95); Writeln; Writeln; 
Writeln ( ’ 1) Yes ’ : A5); 

Writeln ( • 2) No ’ -.45); 

Read (c); Readln; Writeln (c:A5); 

if (c = ’1*) then Inf o . Want_Dix_Grow := true 

else if (c = *2*) then Info . Want_Dix_Grow: =false; 

Until (c = '!') or (c = *2’); 
end; if allswell 
end ; 



end 
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procedure Init_Reader; 
begin 

Reset (Intext/ * Intext text a’); 
end; 



procedure Ini t_Researcher ; 
var 
i ^ 

stepcount / 
threshhold^ 
dixlength : integer; 
entry : entrytype; 
balanced : boolean; 
begin 

Reset (Dixonary^ ’Dixonary text a’); 

Rewri te( Newwords^ ’Newwords text a’); 

if Inf o . Want_Concord then Rewrite (Concrdnc/ ’Concrdnc text a’); 
if ( Inf o . frequency <> When^Done) then begin 
Dixlength *.= 0; 

while not EOF (Dixonary) do begin 

Readln (Dixonary); Dixlength ;= dixlength + 1; 
end; 

Reset (Dixonary/ ’Dixonary text a’); 
end; 

case Inf 0 . Frequency of 

Frequently : threshhold j= freqnum; 

Seldomly : threshhold := selnum; 

When_Done ; threshhold := WhenDonenum; 

end; (^ cases x) 

New (Root); 

Load ( 2 , Root^); 

Root-. left := nil; Root-. right := nil; 

Root-. down := nil; 

Root- . balance := Zero; 

Root-. status := Storable; 

Lastward := nil; 

Stepcount := 0; 

while not EOF (Dixonary) do begin 
Load ( 2 , Entry); 

Entry . balance := Zero; balanced := false; 

AVL_Insert (Entry, Root, balanced); 
if ( Inf o . frequency <> When_Done) then begin 
stepcount := stepcount + 1; 

if (stepcount mod threshhold = 0) then begin 
Page; Writeln; Writeln; Writeln; Writeln; 

Writeln (’Stylist is now loading its dictionary. ’:50); 
Writeln; Writeln; 

Writeln (’ Last word loaded was ’;50); 

Writeln ( entry . word ; 40 ) ; Writeln; Writeln; 

Writeln (’Stylist has loaded ’: 30 , stepcount : 5 , ’ entries.’); 
Writeln (’out of a dictionary of ’ : 33 , Dixl ength •. 5 , ’ words.’) 
Writeln; Writeln; 

Writeln ( Stepcount^l 00 div Dixlength : 33, ’ % complete.’); 

end; if X) 
end; if 

end; (^ while not EOF 
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with tally do begin 
totalletters := 0; 
totalwords := 0; 
totalsentences := 0; 
totalstrucltrs := 0; 
numwordsthissent := 0> 

for i : = 0 to wordlength do 1 trsperwordC . i . ) := 0; 
for i := 0 to maxsent do wordspersent ( . i . ) := 0; 

KnownWords i= 0; 

NewWords := 0; 

Numfound := 0; 

VariantWords := 0; 

Structurewords := 0; 

UnknownWords := 0; 

NumLatinate := 0; 

NumGermanic :=0; 

NumPostGrad : =0; 

NunGrad : = 0 ; 

NumHigh_School := 0; 

NumEl ementa ry := 0; 

Numlangible := 0; 

Numlntangibl e := 0; 

NumSubl ime : = 0 ; 

NumPleasant := 0; 

NumNeutral ;= 0; 

NumUnpleasant := 0; 

NumHor ri d : = 0 ; 

NumViolent := 0; 

NumEnergetic := 0; 

NumCa Im : = 0 ; 

NumI nert : = 0 ; 
end; with tally do 

Reset (FourDix^ ^FourDix text a*); 

New (FourRoot); FourRoot-* . ri ght := nil; FourRoot-' . lef t ;= nil; 
Load (1, FourRoot-); 

Attach (FourRoot-, FourRoot); 

FourRoot- . ba lance := Zero; 

While not EOF (FourDix) do begin 
Load (1, Entry); 

Entry . balance := Zero; 
balanced := false; 

AVL_Insert (Entry, FourRoot, balanced); 
end; 
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procedure Load (filenum : integer; var entry : entrytype); 
var 

sourcel tr^ diff icul tyltr/ 
concreteness! tr/ emotion! tr^ 
vigor!tr ; char; 
begin 

if (fiienum = !) then Read!n (FourDix^ Entry. word, 

source! tr, dif ficu! ty! tr , 
concreteness!tr , emotion!tr, 
vigor! tr) 

else Read!n (Dixonary, Entry .word, sourceitr, dif f icu! ty! tr , 

concreteness! tr, emotion!tr, 
vigor !tr ) ; 

with entry do begin 

if (sourceitr = *!') then source := Latinate 

e!se source j= Germanic; 

case dif ficu! ty! tr of 

*p* : difficuity := Postgrad; 

*g* : difficuity ••= Grad; 

*h' : difficuity := High_Schooi; 

*e* ; difficuity := Eiementary; 

end; 

if ( concreteness! tr= 't’) then concreteness := Tangibie 

eise concreteness := Intangibie; 

case emotionitr of 



's' : 


emotion : 


= Subiime; 


'p» : 


emotion : 


= Pieasant; 


' n ' : 


emotion : 


= Meutrai; 


'u' : 


emotion : 


= Unpieasant; 


'h' : 


emotion : 


= Horrid; 


end; 

case vigoritr of 
' V * i vigor : = 


Vioient ; 


I e' : 


vigor : = 


Energetic; 


'C' : 


vigor := 


Cairn; 


'i' : 


vigor := 


Inert; 


end; 

; with entry do 


X) 
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procedure AVL_Insert (Entry 



entrytype; var p : Ent rypointer ; 
var balanced j boolean); 



var 

pl^ p2 : entrypointer; 
begin 

if (p = nil) then begin 
Attach (Entry, p); 

P-. balance := Zero; 
balanced i= true; 

end 

else if (Entry. word < p^.word) then begin 

AVL_Insert (Entry, p^.left, balanced); 

If Balanced then (^ left pointer has grown higher ^) 
case p^. balance of 

Plus : begin p^. balance ;= Zero; balanced := false; end; 

Zero : p^. balance ;= Minus; 

Minus : begin (^ rebalance ^) 
pi := p-.left; 

if (pi-. balance = Minus) then begin(^ single LL rotate) 
P-.left := pi-. right; 
pl-. right := p; 

P-. balance ;= Zero; 

P : = pl ; 

end (^ if ^) 

else begin (^ double LR rotation ^) 
p2 := pl-. right; 
pl-. right := p2-,left; 
p2- . left s = pl ; 
p-.left := p2-. right; 
p2-. right := p; 

if (p2-. balance = Minus) then p-. balance := Plus 
else P-. balance := Zero; 

if (p2-. balance = Plus) then pl-. balance := Minus 
else pl-. balance := Zero; 

P := p2; 
end; (^ else ^) 

P-. balance := Zero; balanced := false; 
end; (^ case of Minus ^) 
end; (^ of cases ^) 
end (^ if Entry. word < p-.word ^) 
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else if C Entry. word > p-.word) then begin 

AVL_Insert (Entry/ p-. right/ balanced); 
if balanced then right pointer has grown higher x) 
case P-. balance of 

Minus : begin p-. balance := Zero; 

balanced := false; end; 

Zero : p^. balance := Plus; 

Plus : begin (^ rebalance 
pi := P-. right; 

if (pl^. balance = Plus) then begin 
(X single RR 
P^. right := pl^.left; 
pi-. left := p; 

P-. balance := Zero; 

P : = pi ; 
end if ^) 

else begin double RL rotation ^) 
p2 := pl-.left; 
pl-.left := p2-. right; 
p2-. right := pi; 

P-. right := p2-.left; 
p2-.left := p; 

if (p2-. balance = Plus) then 
P-. balance := Minus 
else P-. balance ••= Zero; 
if (p2-. balance = Minus) then 
pi-. balance := Plus 
else pi-. balance s= Zero; 

p := p2; 

end; double RL rotation 
P-. balance := Zero; balanced := false; 
end; case of balance = Plus X) 
end; (x of cases x) 
end (^ of if Entry. word > p-.word 
else balanced := false; 
end; of procedure AVL_Insert ^) 

procedure Ini t_Reporter ; 
begin 

Rewrite (Report/ ’Report text a’); 

Writeln (Report); Writeln(Report); Writeln( Report ) ; 

Writeln (Report/ Inf o . Name_of_Text : ^5) ; 

Writeln (Report/ ’ PROFI L E ’ : 45 ) ; 

Writeln (Report); Writeln( Report ) ; 

end; 
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procedure Read_Intext; 
va r 
i / 

letternum; 
wordnum, 
threshhold, 
stepcount / 
offset : integer; 
c : char; 
n : readtype; 

begin 

letternun := 0; 
wordnum : = 0 ; 
stepcount := 0; 
case Inf o . Frequency of 

Frequently : threshhold := freqnum; 

Seldomly : threshhold := selnum; 

Hhen_Done : threshhold •. = WhenDonenum; 

end; cases X) 

offset := ord (’A») - ord C*a*); 
while not EOF CIntext) do begin 

while not EOLN (Intext) and Allswell do begin 
Read (Intext, c); 

if (c in ( . * A * . . * Z * . ) ) then c := chr(ord(c) - offset); 
if ((c in ( . ’ a ’ . . ’ z * . ) ) or (c = Succ(*3*)) 

or (c in ( . * 0 * . . ’ 9 ’ . ) ) ) then begin 
letternum : = letternum + 1; 

if (letternum <= wordlength) then n . word( . 1 etternum . ) := c; 

end; 

if (letternum > 0) then begin 

if (c = * *) or (c = ’-*) or EOLN (Intext) or 

(c = ’.*) or (c = ’?’) or (c = *!*) or (c = ’;*) then 
begin 

for i := letternum + 1 to wordlength do n . word( . i . ) : = * *; 

if (letternum > wordlength) then n. length := wordlength 
else n. length := letternum; 

Store_Intext (n); 

Wordnum •• = Wordnum + 1; 
letternum := 0; 

if (c = *.*) or (c = *?’) or (c = *!*) or (c = *;*) then 
if (wordnum > 1) then begin (X not embedded command X) 
Signa l_End_of_Sentence; 
wordnum *.= 0; 

end; 

stepcount := stepcount + 1; 

if (stepcount mod threshhold = 0) then begin 
Page; Writeln; Writeln; Writeln; Writeln; 

Write (’Stylist is now reading : ’); 

Writeln (Info. Name_of_text ) ; 

Writeln (’Last word read was ’, n.word); 

Writeln; Writeln; 

Write (’Stylist has read stepcount j 5 ) ; 

Writeln (’ words. ’);Writeln; 

Writeln; Writeln; 

Writeln ( S tepcount^d 00 div Info.Size;3, ’ X complete.’); 
end; (X if X) 
end; 

end; (^ letternum > 0 )() 
end; Readln (Intext); 
end; (^ while not EOF ^) 

Signa l_End_of_Fi 1 e; 
end; 



66 



readtype) ; 



procedure Store_Intext (n ; 
var CP : concordpointer; 
found : boolean; 
entry *. entrytype; 
begin 

tal ly . totalwords := tal ly . totalwords + 1; 
tally . totalletters := tal ly. total 1 etters + n. length; 
tally . numwordsthissent := tal ly . numwordsthissent + 1; 
found := false; 

if (n. length < 5) then Look_Up_Four (n, FourRoot/ found); 
if not found then 

tally . LtrsPerWordC . n . length . ) *. = tally . LtrsPerNordC . n . length . ) + 1; 
if not found then Look_Up (n^ Root); 
end; 



procedure Si gnal_End_of_Sentence; 
begin 

tal ly . totalsentences : = tal ly . totalsentences + 1; 
if ( tal ly. totalsentences > maxsent) then begin 

Writeln (*INTEXT TOO BIG.*); Allswell := false; 

end 

else tal ly . WordsPerSentC . tal ly. totalsentences . ) := 
tally. numwordsthissent ; 
tal ly . numwordsthissent := 0; 
end; 



procedure Signal_End_of_Fi le; 
begin 

if Inf o . Want_Concord then Lastwa rd^ . next := nil; 

ta 1 ly . numf ound := tal ly . totalwords - tal ly . unknownwords; 

Page; Writeln; Writeln; Writeln; Writeln; 

Write ('Stylist has finished reading : *); 

Writeln ( I nf o . Name_of_text ) ; 
end; 



procedure Look_Up_Four (n : readtype; var ancestor : entrypointer ; 

var found : boolean); 

var cp : concordpointer ; 

entry : entrytype; 
begin 

if (ancestor = nil) then found := false 
else if (ancestor^ .word = n.word) then begin 
found : = true; 

if (ancestor- .vigor <> inert) then Add_Values (ancestor); 
tally . totalstrucltrs := tal ly . totalstrucltrs + n. length; 
tal ly . structurewords := ta 1 ly . st ructurewords + 1; 
if Inf o . Want_Concord then 

Attach_Concord_Node (ancestor^ cp); 
end (^ if ^) 

else if ( ancestor- . word > n.word) then 

Look_Up_Four (n, ancestor- . 1 eft, found) 
else Look_Up_Four (n, ancestor- . right , found); 
end; 
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procedure Look_Up (n : readtype; var ancestor ; entrypoi nter ) 
var cp : concordpointer ; 
entry : entrytype; 

P ; entrypointer ; 
found : boolean; 
c : char; 
begin 

if (ancestor = nil) then begin 

L ook_Up_Variants in, p, found); 
if found then begin 

tal ly . variantwords ;= ta 1 ly . variantwo rds + 1; 

New (ancestor ) ; 
with ancestor- do begin 
word : = n . word; 
left := nil; right := nil; 
down : = nil; 

end; (X with ancestor do 50 
if (p-. status <> not_valuable) then begin 
Add_Values (p); 
with ancestor- do begin 
status : = not^storabl e; 
source ;= p-. source; 
difficulty := p- . dif f icul ty; 
concreteness := p- . concreteness; 
emotion := p-. emotion; 
vigor := p-. vigor; 
end; with ancestor do 5e) 
end else p-. status is other than not_valuable 5() 
ancestor- . status := not_valuabl e; 



end 

else if Inf o . Want_Dix_Grow then begin 
Repeat 

Page; Writeln; Writeln; Writeln; 

Nriteln (n.word; 50); Writeln; Writeln; 
Writeln (’ADD IT T':A5); Writeln; Writeln; 
Writeln (’1) Yes’: ^5); 

Writeln ( ’2) No ’ : A5); 

Read (c); Readln; 

Until (c = ’!’) or (c = ’2’); 
if (c = ’1’) then begin 
Entry. word := n.word; 

Get_Values_f or_New_Entry (Entry) ; 

Attach (Entry, ancestor); 

Tal ly . newwords := tally . newwords + 1; 
Add_Values (ancestor); 
end else Attach_No_Va lue (ancestor, n); 



end 



else Attach_No_Value (ancestor, n); 
if I nf o . Want_Concord then 

Attach_Concord_Node (ancestor, cp) ; 
end ( 5^ in ancestor = nil 5() 

else if (ancestor- . word = n.word) then begin 

if (ancestor- . status <> not_storabl e) then 

tal ly . knownwords := tal ly . knownwords + 1 
else tally . unknownwords := ta 1 ly . unknownwords + 1; 
if ( ancestor- . status <> Not_Valuable) then 
Add^Values (ancestor) ; 
if Inf o . Want_Concord then 

Attach Concord_Node (ancestor, cp); 
end if 50 

else if (ancestor- .word > n.word) then 
Look_Up (n, ancestor- . left ) 
else Look_Up (n, ancestor- . right ) ; 
end; 
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procedure Attach (Entry ; entrytype; var p : entrypoi nter ) 
begin 

New (p); 

with p-' do begin 

word := entry. word; 
status := storable; 
source := entry . source; 
difficulty := entry . difficulty; 
concreteness := entry . concreteness; 
emotion := entry . emoti on ; 
vigor := entry. vigor; 
left := nil; right := nil; 
down : = nil; 
end; 
end; 



procedure A ttach_Concord_Node (p : entrypointer ; 

var CP : concordpointer ) ; 

var Bottom, 

Bottommost : concordpointer ; 
begi n 

Bottom := p^.down; 
while (Bottom <> nil) do begin 
Bottommost ; = bottom; 

Bottom := Bo ttom- . down ; 
end; 

New (cp); 

With cp^ do begin 
Up := p; 

if (Lastward = nil) then Last := nil 
else last := Lastward; 

Down := nil; 
end; 

if (P-'.down = nil) then p- . down := cp 
else Bottommost- . down := cp; 
if (Lastward <> nil) then 
cp- . last- . next := cp; 
lastward := cp; 
end; 



procedure Attach_No_Value (var ancestor *. entrypointer ; 

n : readtype); 

begin 

Writeln (newwords, n.word); 

New (ancestor); Ancestor- . word := n.word; 
ta 1 ly . unknownwords := tal ly . unknownwords + 1; 
Ancestor- . right := nil; Ancestor- . 1 eft := nil; 
Ancestor- . down := nil; 

Ancestor- . status := not_valuable; 
end; 
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procedure Look_Up_Variants (original : readtype; var p : entrypointer ; 

var found : boolean); 

var variant : readtype; 

i : integer; 
begin 

found := false; 

if ( original . length > 3) then 

if ( original .wordC . original . length . ) = *s') then begin 
variant. word := ori gi na 1 . wo rd; 
variant . wordC . original . length . ) := * '; 

variant . 1 ength := original . length - 1; 

Look_Up_Va riant (variant/ Root/ found, p); 

if not found and (variant .word( .variant . length. ) = 'e') then 
begin 

va riant . word( . variant . length . ) := ' 

va riant . 1 ength := va riant . 1 ength - 1; 

L ook_Up__Va riant (variant. Root, found, p); 

if not found and (variant .word( . variant . length . ) = *i') then 
begin 

variant .word( .variant . length. ) := *y'; 

Look_Up_Variant (variant. Root, found, p); 
end; 
end; 

end 

else if ( original . 1 ength > 4) then 

if (original .word( . original . length. ) = 'g*) and 
( original . word( . original . length - 1.) = 'n*) and 
( original . word( . original . length - 2.) = 'i') then begin 
variant. word ;= original .word; 

for i := ori gina 1 . 1 ength - 2 to original . 1 ength do 
variant . word( . i . ) ;= ' '; 

variant . length := original . length - 3; 

L ook_Up_Variant (va riant , Root , found, p); 
if not found then begin 

variant . 1 ength := va riant . 1 ength + 1; 
var iant . word( . va riant . 1 ength . ) *. = 'e'; 

Look_Up_Variant (variant. Root, found, p); 
if not found and 

(variant .word( . variant . length - 2.) = 
variant . word( . variant . length - 1.)) then begin 
variant . 1 ength := va riant . 1 ength - 2; 
variant .word( .variant . length + 1.) := * '; 

va riant . word( . variant . 1 ength + 2.) := * 

L ook_Up_Va riant (variant. Root, found, p) ; 

end; 

end; 

end 
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else if ( original . length > 4) then 

if (original .wordC . original . length. ) = *d’) and 

(original .wordC . original . length - 1.) = *e*) then begin 
variant. word := original .word; 

for i := original . length - 1 to original . length do 
variant . word( . i . ) 

variant . length := origina 1 . length - 2; 

Look_Up_Variant (variant , Root , found, p); 
if not found then 

if (variant . word( . variant . 1 ength . ) = 'i*) then begin 
variant .word( .variant . length. ) := *y'; 

variant . length := va riant . length - 1; 

Look_Up_Variant (variant , Root , found, p); 
end; 

if not found then begin 

variant . length := variant . length + 1; 
variant . word( . variant . 1 ength . ) := 'e*; 

Look_Up_Variant (variant. Root, found, p); 
if not found and 

(variant . word( . variant . 1 ength - 2.) = 
variant .word( .variant . length - 1.)) then begin 
variant . length := va riant . 1 ength - 2; 
variant .word( . variant . length + 1.) := * 

variant .word( . variant . length + 2.) := ’ *; 

Look_Up_Variant (variant. Root, found, p); 

end; 

end; 

end 

else if ( original . length > 4) then 

if ( original .word( . original . length . ) = *y*) and 

( original . word( . original . length - 1.) = '!') then begin 
variant. word ;= ori gi na 1 . word ; 

for i := original . length - 1 to original . length do 
variant .word( . i . ) := * 
variant . length := ori gi nal . 1 ength - 2; 

L ook_Up_Variant (variant , Root, found, p); 
if not found then begin 

variant . length := variant . 1 ength + 2; 
variant . word( . variant . length . ) := 'e'; 

variant , word( . variant . length - 1.) := 

Look_Up_Variant (variant. Root, found, p); 
if not found and 

(variant .word( .variant . length - 2.) = *i') then begin 

variant . 1 ength := variant . length - 2; 
variant .word( .variant . length . ) := *yM 

variant .word( .variant . length + 1.) := • •; 
variant . word( . variant . length + 2.) := * •; 

Look_Up_Variant (variant. Root, found, p); 

end ; 
end; 

end; 

end; 
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procedure Look_Up_Va riant (variant : readtype; 

var ancestor : Entrypointer ; 
var Found : Boolean; 
var p : Entrypointer ) ; 

begin 

if (ancestor = nil) then found := false 
else if (ancestor^ .word = variant . word) then begin 
found ;= true; 
p := ancestor; 

end 

else if (ancestor-' .word > variant .word) then 

L ook_Up_Variant (variant, ancestor^ . left , found, p) 
else L ook_Up_Va riant (variant, ancestor — -.right, found, p) 
end; 



procedure Add_Values (p *. entrypoi nter ) ; 
begin 

if (p-. source = Latinate) then 

Tally . NumLatinate := Tally . NumLatinate + 1 
else Tally . NumGermanic := Ta 1 ly . NumGermani c + 
case p^ . dif f icul ty of 



1 ; 



PostGrad 
Grad 

High_School 
Elementary 
end; 

if ( . concreteness = Intangible) 
Tally . Numintangible := Tally 
else Tal ly . NumTangible := Tally 

case p^. emotion of 



Tally . NumPostGrad := Ta 1 ly . NumPostGrad + 1; 
Tally . NumGrad := Tally . NumGrad + 1; 

Tally . NumHigh_School := Tally . NumHigh^School 
Tal ly . NumEl ementary := Tally . NumElementary + 



then 

Numintangible + 1 
NumTangible + 1; 



+ 1 
1 ; 



Sublime 
Pleasant 
Neutra 1 
Unpleasant 
Horri d 
end; 

case p-. vigor 
Violent 
Energetic 
Calm 
Inert 
end; 
end; 



Tally. NumSublime 
Tally. NumPleasant 
Tally. NumNeutral 



= Tal ly. NumSublime + 1; 

:= Tally . NumPleasant + 1; 

;= Tal ly . NumNeutral + 1; 

Tal ly . NumUnpleasant := Ta 1 ly . NumUnpleasant + 1; 
Tally . NumHorrid := Tal ly . NumHorrid + 1; 



of 



Tal ly . Numviolent := tally . numviolent + 1; 
Ta 1 ly . NumEnergeti c := Tal ly . NumEnergeti c + 
Tally . NumCalm *. = Tal ly . NumCalm + 1; 

Tally . Numlnert := Ta 1 ly . Numlnert + 1; 



1 ; 
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procedure Get_Val ues_f or_New_Entry (var Entry •• entrytype); 
var c ; char; 

good_char : boolean; 
begin 

Repeat 

Page; 

Writeln; Writeln; Writeln; 

Writeln ('SOURCE Writeln; Writeln; 

Writeln ( Entry . word ; <^5 ) ; 

Writeln; Writeln; 

Writeln Cl) LATINATE *:A5); 

Writeln C2) GERMANIC »:A5); 

Read (c); Writeln (c:A5); Readln; 

Until (c = U') or (c = *2’); 
if (c = '1') then Entry. source := Latinate 
else Entry. source := Germanic; 

Repeat 

Page; 

Writeln; Writeln; Writeln; 

Writeln ('DIFFICULTY LEVEL *:A5); 

Writeln; Writeln; 

Writeln ( Entry . word : A5 ) ; Writeln; Writeln; 

Writeln Cl) POSTGRADUATE ^5 ) ; 

Writeln C2) GRADUATE ’;A5); 

Writeln C3) HIGH SCHOOL »:A5); 

Writeln CA) ELEMENTARY »:A5); 

Read (c); Writeln (c;A5); Readln; 

Until (c = '!') or (c = '2') or (c = *3') or (c = 'A'); 
case c of 

: Entry . difficulty := PostGrad; 

'2' : Entry . difficulty ;= Grad; 

•3' : Entry . di ffi cul ty := High_School; 

'A' : Entry . difficulty := Elementary; 

end; 

Repeat 

Page; Writeln; Writeln; Writeln; 

Writeln (’CONCRETENESS ’:A5); Writeln; Writeln; 
Writeln ( Entry . word : A5 ) ; Writeln; Writeln; 

Writeln Cl) TANGIBLE ’:A5); 

Writeln C2) NOT TANGIBLE ’:A5); 

Read(c); Readln; Writeln; Writeln (c:A5); 

Until (c = ’!*) or (c = ’2*); 

if (c = '!') then Entry . concreteness := Intangible 
else Entry . concreteness := Tangible; 



Repeat 

Page; Writeln; Writeln; Writeln; 
Writeln (’EMOTIONAL CONNOTATION ’ 



Writeln 
Wri tel n 
Wri teln 
Writeln 
Wri teln 
Writeln 
Read( c) ; 

Until (c = 



(Entry. word: A5) 



’ 1 ) 

* 2 ) 

’3) 

’A) 

’5) 



Readln; 
' 1 ') 



SUBLIME 
PLEASANT 
NUETRAL 
UNPLEASANT 
HORRID 

Writeln ; 
(c = ’2’) 



or 



Writeln; 

A5). 

A5) 

A5) 

A5) 

A5); 
Writeln 
or ( c = 



:A5); Writeln; 
Writeln; 



(c:A5); 

’3*) or (c = 



case c of 

’1* : Entry . emotion 
*2* Entry . emotion 
’3’ : Entry . emotion 
’A’ : Entry . emotion 
’5* : Entry . emotion 



Sublime; 
Pleasant; 
Neutral ; 
Unpl easant ; 
Horri d; 



end; 



Writeln; 



’A’)or(c = 



’5’ ); 
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end; 



Repeat 

Page; Writeln; Writeln; Writeln; 

Writeln ( ' VIGOR * : ^5 ) ; Writeln; Writeln; 
Writeln ( Entry . word : 45 ) ; Writeln; Writeln; 
Writeln Cl) VIOLENT ' :45); 

Writeln C2) ENERGETIC *:45); 

Writeln ( '3) CALM • :45) ; 

Writeln ( '4) NONE ' :45); 

Read (c); Writeln (c:45); Readln; 

Until (c = '!*) or (c = *2*) or (c = *3*) or (c 
case c of 



* 1 * : Entry . vi gor 

*2* : Entry. vigor 

'3* : Entry. vigor 

'4* : Entry. vigor 

end; 



Violent; 

Energetics- 

Calm; 

Inert ; 



* 4 * ) ; 



procedure Wri te_Concordance; 
begi n 

Page; Writeln; Writeln; Writeln; Writeln; 

Write ('Stylist is now writing the concordance for '); 

Writeln ( I nf o . Name_of_text ) ; 

Writeln; Writeln; 

Writeln ('Concordance will be written to "Concrdnc text a"':50); 
Writeln; Writeln; 

Writeln (Concrdnc); Writeln (Concrdnc); 

Writeln (Concrdnc, ' CONCORDANCE 55 ) ; 

Writeln (Concrdnc, Inf o . Name_of_text : 55) ; 

Writeln (Concrdnc); Writeln (Concrdnc); 

I norder_Concordance (Root); 
end ; 
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procedure Inorder_Concordance (var Root : entrypointer) ; 



( 3 ( 

(X 

var 



This is the most complex procedure in the program. It traverses 
the Dixonary Binary Search Tree (BST) in inorder fashion, this 
visiting each important word used in the Intext in alphabetic 
order. During the visit to each word, it goes down the concord 
list one at a time, thus examining each use of that word in the 
order it was used in the Intext. At each visit down the concord x) 



list, it follows the Lastward linked list to find the phrase in 
the Intext that preceded that instance of the use of the word, 
prints the phrase, capitalizes and prints the word, then follows 
the Nextward linked list to find and print the phrase following. 



concordpointer; 

Phrasetype; 



(X 



Lastward, 

Nextward, 

Downward 
Phrasel , 

PhraseZ 
w, 

1 , 

offset , 
i : integer; 
c : char; 
begin 

offset := ord (’A’) - ord (*a*); 
if (Root <> nil) then begin 

Inorder_Concordance ( Root- . 1 eft ) ; 

Downward := Root-. down; 

Nhile (Downward <> nil) do begin 
w : = 0 ; 

Lastward := Downward- . last ; 

while (Lastward <> nil) and (w < Phrasel ength) do begin 
w : = w + 1 ; 

PhraseK.w.) := Lastward- . up- . word; 

Lastward := Lastward- . last ; 
end; 

for i := w downto 1 do begin 
1 := 1 ; 

while (1 <= wordlength) do begin 
if (Phrasel(.i.)(.l.) <> • «) then 

Write (Concrdnc, Phrasel (.i.)(.l.)); 

1 := 1 + 1 ; 



X) 

^) 

X) 

^) 

^) 



(^ pointer to word previous in intext 
(^ pointer to word following in intext^) 
(^ pointer to next use of same word ^) 
phrase preceding key word in concord^) 
phrase following key word in concord^) 

# of word being printed ^) 

# of letter in word being printed X) 



• : 1 ); 
•); 



end; 

Write (Concrdnc, 
end; 

Write (Concrdnc, ’ 

1 := 1 ; 

while (1 <= wordlength) do begin 
c := Root- . wordC . 1 . ) ; 
if (c in ( . *a * . . * z* . ) ) then 
c := chr(ordCc) + offset); 
if ( Root- . word( . 1 . ) <> ’ ’) then 

Write (Concrdnc, c); 

1 := 1 + 1 ; 



end; 

Write (Concrdnc, * *); 

w t = 0 ; 

Nextward := Downward- . next ; 
while (Nextward <> nil) and 

(w < Phraselength) do begin 
w : = w + 1 ; 

PhraseZC.w.) ;= Nextward- . up- .word; 
Nextward := Nextwa rd- . next ; 
end; 
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for i : = 1 to w do begin 
1 := 1 ; 

while (1 <= wordlength) do begin 

if (Phrase2(.i.)(.l.) <> ’ then 

Write (Concrdnc, PhraseZC . i . ) ( . 1 . ) ) ; 
1 := 1 + 1 ; 
end; 

Write (Concrdnc^ ’ *:1); 

end; 

Writeln (Concrdnc); 

Downward := Downward- . down; 
end; 

Inorder_Concordance ( Root- . right) ; 
end; if Root <> nil 
end; ( 3( procedure I norder__Concordanc e 
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procedure Calculate_Prof i 1 e; 

var numsigltrs : integer; number of significant letters 

numsigwrds : integer; number of significant words X) 

in this context, structureal words are insignificant 50 
begi n 

if (tally. totalsentences = 0) then tal ly . totalsentences := 1; 
if ( tally . totalwords = 0) then tal ly . totalwords := 1; 
numsigwrds := ta 1 ly . totalwords - tal ly . structurewords; 
numsigltrs := tal ly . totall etters - tal ly . totalstrucl trs; 

Profile ;= tally; 

profile. Ave_ltrs_per_word := numsigl trs/numsigwrds; 

profile . Ave_wrds_per_sent ; = tally. total words/ tal ly . totalsentences; 
end; 



procedure Analyze_Prof i 1 e_and_Report ; ( main Reporter procedure 50 

begin 

Page; Writeln; Writeln; Writeln; Writeln; 

Nrite ('Stylist is now analyzing the style of O; 

Writeln ( Inf o . Name_of_text ) ; 

Writeln; Writeln; 

Writeln ('Report of analysis will be written to '’Report text a'":50); 
Writeln; Writeln; 

Writeln (Report); Writeln (Report); 

Writeln (Report); Writeln (Report); 

Calculate_Size_of_Text (profile, analysis); 

Calculate_Length_of_Words (profile, analysis); 

Ca 1 cu la te_L eng th_of_Sen fences (profile, analysis) ; 

Calculate_Etymology (profile, analysis); 

Calculate_Dif ficulty_of_Vocabulary (profile, analysis); 
Calculate_Tangibility (profile, analysis) ; 

Calculate_Emotional_Tone (profile, analysis); 

Calculate_Vi gor_of_Words (profile, analysis); 

Make_Recommendations (analysis) ; 
end; 
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procedure Calculate_Size_of_Text (profile : profiletype; 

var analysis ; ana lysistype) ; 

var i , 

PerCentFound : integer; 
begin 

Writeln (Report) ; Wri tel n( Report ) ; Writeln(Report); 

Write (Report, 'Total of sentences : *); 

Writeln (Report, prof i 1 e . Tota Isentences) ; 

Write (Report, 'Total of words : '); 

Writeln (Report, profile. Tota 1 words) ; 

Write (Report, 'Total of letters : '); 

Writeln (Report, prof i 1 e . tota 1 1 etters) ; 

Writeln (Report); Wri teln( Report) ; Wri tel n ( Report ) ; 
if ( prof i le . totalwords < 500) then begin 

Write (Report, P rof i 1 e . to ta Iwords , ' words are '); 

Writeln (Report, 'too few for valid statistical analysis.'); 

end 

else if ( profi le . totalwords < 1000) then begin 

Write (Report, Prof i 1 e . to ta Iwo rds , ' words are'); 

Writeln (Report, ' enough for valid statistical analysis.'); 



end 

else begin 

Write (Report, Prof i le . totalwords, ’ words are '); 

Writeln (Report, ' plenty for valid statistical analysis.'); 
end; 

Writeln ( Report ) ; Writeln(Report); Wri tel n( Report ) ; 

Write (Report, 'Of the ', prof i 1 e . totalwords : 6 , ' words in '); 

Writeln (Report, Inf o . Name_of_Text , ','); 

Write (Report, prof i 1 e . numf ound : 6 ) ; 

Writeln (Report,' were matched to words in the Stylist dictionary.'); 
percentfound := prof i 1 e . numf ound ^ 100 div prof i 1 e . tota Iwords ; 

Writeln (Report); 

Writeln (Report, Percen tf ound : 2, ' V* were matched.'); 

if (percentfound < 50) then begin 

Write (Report, I nf o . Name_of_text ) ; 

Writeln (Report, ' must contain many specialty or unique words.'); 
Writeln (Report, 'Statistical analysis is not valid.'); 



end 

else Writeln (Report, 'This is enough for statistical analysis.'); 
if ( prof i le . newwords > 0) then begin 

Write (Report, 'You added ', prof i 1 e . newwords ) ; 

Writeln (Report,' to the dictionary during this session.'); 
end; 

end; (^ procedure Calculate Size of intext ^) 
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procedure Calculate_Length_of_Words (profile : profiletype; 

var analysis : anaiysistype) 

begin 

Graph_L trsPerWord ( profile . 1 trsperword) ; 

Writeln (Report); Writeln (Report); 

Write (Report/ 'The average number of letters per word : '); 

Writeln ( Report/ profile . Ave_l trs_per_word ) ; 

Wri tel n( Report ) ; Writeln(Report); 

if (Info. Genre = NonFiction) then begin 

Writeln (Report/ 'A typical nonfiction texts distribution ') 
Write (Report/ 'resembles a low bell-shaped curve '); 

Writeln (Report/ 'centered around six 1 etters/word . ' ) ; 
if ( prof i 1 e . Ave_L trs_Per_Word > 7) then 
Ana lysis . WordL ength := Toolong 
else if ( prof i 1 e . Ave_L trs_Per_Word > 6.5) then 
Analysis . WordLength := Long 
else if ( prof i le . Ave_L trs_Per_Word > 5.5) then 
Analysis . WordLength := medium 
else if ( prof i le . Ave_L trs_Per_Word > 5) then 
Analysis . WordLength := short 
else Analysis . WordLength := tooshort 

end 

else begin (^ genre is fiction ^) 

Writeln (Report/ 'A typical fiction texts distribution '); 
Write (Report/ ' resembles a tall bell-shaped curve '); 
Writeln (Report/ 'centered around five letters/word . ' ) ; 
if ( prof i 1 e . Ave_L trs_Per_Word > 6) then 
Analysis . WordLength := Toolong 
else if ( profile .Ave_Ltrs_Per_Word > 5.5) then 
Analysis . WordLength := Long 
else if ( prof ile . Ave_Ltrs_Per_Word > 5) then 
Analysis . WordLength *.= medium 
else if ( prof ile . Ave_Ltrs_Per_Word > 4) then 
Analysis . WordL ength := short 
else Ana lysis , WordL ength := tooshort; 

end; 

Writeln ( Report) ; 

Write (Report/ 'The length of the words is '); 

case Analysis . WordLength of 

TooShort : Writeln (Report/ 'too short.'); 

Short : Writeln (Report/ 'short.'); 
medium : Writeln (report/ 'medium'); 

Long : Writeln (Report/ 'long.'); 

Toolong : Writeln (Report/ 'too long.'); 

end; 

Writeln (Report); Wri teln ( Report ) ; Wri teln( Report ) ; 
end; (^ calculate length of words 3() 
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procedure Graph_L trsperword (Itrsperword : 1 trsperwordtype) ; 
van perltrwords : It rsperwordtype; 

i, j : integer; 
begin 

Page (Report); 

WritelnC Report ) ; Wri tel n( Report ) ; Wri tel n( Report ) ; 

Writeln ( Report BREAKDOWN OF PERCENT OF LETTERS PER WORD*:60); 

W rit el n( Report ) ; Wri tel n( Report ) ; 
for i : = 0 to wordlength do 

Perl trNordsC . i . ) := ( L trsPerWordC . i . ) ?^1 00 ) div 

(tally . total words -tal ly . struct urewords) ; 
for i : = 15 downto 1 do begin 
Write ( Report / i^2 : 2/ * 

for D := 1 to wordlength do 

if ( PerLtrWords( . j . ) >= i^2) then 
Write (Reports* X *:5) 

else if ( Perl trWords( . j . ) >= (i^2)-l) then 
Write (Report,* . *:5) 

else 

Write (Report,* *:5); 

Writeln (Report); 
end; 

Write (Report,* *:<^); 

for i : = 1 to wordlength do 

Write ( Report , i : 5 ) ; 

Writeln (Report); 

Writeln ( Report ,* Number of Letters in a Word'-.^^S); 
end; 
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procedure Calculate_L ength_of_Sentences (profile : profiletype; 

var analysis : ana lysis type ) 

var 

i / 

Sweetspots/ 

Runons/ 

PerCentRunons> 

PercentSweetSpo ts : integer; 
begin 

Graph_WordsPerSent ( prof i 1 e . Words Per Sent) ; 

Write (Report/ *The average number of words per sentence : *); 

Nriteln (Report/ prof i le . Ave_wrds_per_sent ) ; 

Wri teln( Report ) ; Wri tel n ( Report ) ; Wri tel n( Report ) ; 

Write (Report/ *A typical modern texts sentences average *); 

Writeln (Report/ * between fifteen and twenty words.*); 
if ( prof i 1 e , ave_wrds_per_sent > 22) then 
Analysis . SentLength := Toolong 
else if ( prof i le . ave_wrds_per_sent > 18) 

Analysis . Sent! ength := Long 
else if ( prof i le . ave_wrds_per_sent > 15) 

Analysis . SentLength := Medium 
else if ( prof i le . ave_wrds_per_sent > 10) 

Analysis . SentLength := Short 
else Analysis . SentLength := TooShort; 

Writeln (Report); 

Write (Report/ 'Sentences are '); 
case analysis . SentLength of 

TooShort : Writeln (Report/ *too short. 

Short : Writeln (Report/ 'short.*); 
medium : Writeln (report/ 'medium'); 

Long J Writeln (Report/ 'long.'); 

Toolong : Writeln (Report/ 'too long.*); 
end; 

Sweetspots := 0; 

Runons : = 0 ; 

for i : = 1 to profile . totalsentences do 

if (profile. wordspersent( .i . ) > A5) then 
runons ;= runons + 1 

else if ( prof il e . wordspersent ( . i . ) > 8) and 

( profi le . wordspersent( . i . ) < 20) then 
sweetspots *.= sweetspots + 1; 

PerCentRunons := runons^lOO div prof i 1 e . tota Isentences; 
if ( PercentRunOns > 5) then Ana lysis . Runons := unacceptable 
else if ( PercentRunOns > 0) then Ana lysis . Runons := acceptable 
else Ana lysis . Runons := nonexistant; 

Writeln (Report); 

' / runons : 4 ) ; 

1 



then 

then 

then 



) 



( Report/ 

( report / 

( Report ) ; 
(Report/ 



'Number of run ons 
'Percent of run ons 



percent runons : A) ; 



' Run ons are ' ) , 



Writeln 
Writeln 
Writeln 
Write 

case Analysis . Runons of 

Nonexistant : Writeln (Report/ 'nonexistent.'); 

Acceptable : Writeln (Report/ 'acceptable.'); 

Unacceptable : Writeln (report/ 'unacceptable'); 
end; 

PerCentSweetSpots := sweetspots^l 00 div profile. totalsentences; 
if (PercentSweetspots > 50) then Analysis . Module ti on := Good 
else if (PercentSweetspots > 20) then Analysis . Modulati on := Average 
else Ana lysis . Modulati on := bad; 

Writeln (Report); 

Writeln (Report/ 'Number of medium length sentences */ sweetspots : A ) 
Write (Report/ 'Percent of medium length sentences *); 

Writeln ( Report/ percentsweetspots : A) ; 

Writeln (Report); 
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Write (Report, 'Modulation is 'In- 
case Analysis . Modulation of 

Good : Writeln (Report, ' good.'); 
Average : Writeln (Report, 'average'); 
Bad ; Writeln (report, 'bad'); 
end; 

end; procedure analyze length of sentences 
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procedure Graph_WordsPerSent (Graph : WordsPerSenttype ) ; 

var i, 3, pagenum : integer; 

begin 

Pagenum := 1; 

While (Tally . totalsentences > (Pagenum - 1) ^ 70) do begin 
Page (Report); 

Writeln ( Report, * BREAKDOWN OF NUMBER OF WORDS PER SENTENCE* : 60); 



Writeln ( Report Number of words*); 
for i ; = 1 ((Pagenum “ 1) ^ 70) to 70 ^ 
Graph(.i.) := (Graph(.i.) div 2); 
for i := 50 downto 1 do begin 
Write ( Report , i5(2 : 2, * *:1); 

for j := 1 ^ ((Pagenum -1) ^ 70) to 70 
if (Graph(.j.) >= i) then 

Write (Report, *X*:1) else Write 
Writeln (Report); 
end; 

Write (Report, * *); 

for i ;= 1 to 7 do 

Write (Report, i 10 + ((Pagenum - 1) 
Writeln (Report); 

Writeln (Report, *Sentence Number* :45); 
Writeln (Report); 

Pagenum := Pagenum + 1; 
end; 
end; 



(Pagenum) do 

^ (Pagenum) do 
( Report, * * : 1 ) ; 

X 70):10); 



procedure Calculate_Etymology (profile ; profiletype; 

var analysis : analysistype) ; 



begin 

Writeln ( Report) ; Writeln(Report); Writeln( Report); 

Writeln (Report, *ETYM0L0GY OF WORDS* :^5); 

Wri tel n( Report) ; Writeln(Report); 

Write (Report, *Number of Latinate words : *); 

Writeln (Report, profile . NumLatinate) ; 

Write (Report, *Number of Germanic words : *); 

Writeln (Report, prof ile . NumGermanic) ; 

Writeln (Report); 

if (Info. Genre = NonFiction) then begin 

if ( prof i 1 e . NumLati nate > prof i 1 e . NumGermani c ^ 3) then 
Analysis . Etymology TooBorrowed 
else if ( profile . NumLatinate > prof i 1 e . NumGermani c 2) then 
Analysis . Etymology := Borrowed 
else if ( prof i le . NumLatinate > prof i le . NumGermani c) then 
Analysis . Etymology := Mixed 

else if ( prof i le . NumLati nate > prof i le . NumGermani c X 0.9) then 
Ana lysis . Etymology := Native 
else Analysis . Etymology := TooNative; 

end 

else (X genre = Fiction begin 

if (profile. NumLatinate > prof i le . NumGermanic ^ 2) then 
Ana lysis . Etymology := TooBorrowed 
else if (profile . NumLatinate > profi 1 e . NumGermani c ^ 1.5) then 
Analysis . Etymology := Borrowed 
else if ( profi le . NumLatinate > profi le . NumGermani c ) then 
Analysis . Etymol ogy := Mixed 

else if (profile . NumLatinate > prof i le . NumGermani c ^ 0.5) then 
Analysis . Etymology : = Native 
else Analysis . Etymology := TooNative; 
end; 

Write (Report, *Etymology is *); 
case Analysis . Etymology of 

TooBorrowed : Writeln (Report, *very borrowed.*); 

Borrowed : Writeln (Report, * borrowed .*) ; 

Mixed ; Writeln (report, *mixed*); 

Native : Writeln (Report, ’native.*); 

TooNative : Writeln (Report, ’very native.’); 
end; 

end; (X procedure calculate etymology X) 



83 



procedure Calculate^Dif f icul ty_of_Vocabulary (profile 

var analysis : 

var 

PerCent Post Grad/ 

PerCentGrad/ 

PerCentHigh : integer; 
begin 

WritelnC Report ) ; Wri telnC Report ) ; 
'DIFFICULTY OF VOCABULARY •: ) ; 

Nri tel n( Report ) ; 

PostGraduate difficulty : *); 

profile. NumPostGrad) ; 

Graduate difficulty : 

profile. NumGrad) ; 

High School difficulty : *); 

profile. NumHigh_School ) ; 

Elementary difficulty : 

profile . NumEl ementary ) ; 

WritelnC Report); 

prof i 1 e . numpostgrad^l 0 0 div profi 
percentGrad := prof i 1 e . numgrad^l 0 0 div prof i 1 e . numf o 
percenthigh := profil e . numhigh_school^l 00 div profil 
if ( percentPostgrad > 0) then ana lysis . Ha rdWords 
else if (percentgrad > 5) then analysis . HardWords := 
else if (percenthigh > 10) then analysis . HardWords : 
else ana lysis . HardWords easy; 

Write (Report/ 'Percent of Postgraduate difficulty 
Writeln (Report/ percentpostgrad) ; 

Writeln (Report/ 'Percent of Graduate difficulty 
Writeln (Report/ 'Percent of High School difficulty 
WritelnC Report ) ; Wri tel n ( Report ) ; 

Write (Report/ 'Difficulty is '); 
case Analysis . HardWords of 

Veryhard: Writeln (Report/ 'very hard'); 
hard: Writeln (Report/ 'hard'); 
challenging: Writeln (report/ 'challenging'); 
easy : Writeln (Report/ 'easy.'); 
end; 

end; (^ Cal culate_Difficul ty_of_Vocabulary 



: profiletype; 
analysistype) ; 



Writeln (Report); 
Writeln (Report/ 
WritelnC Report); 
Write (Report/ ' 
Writeln (Report/ 
Write (Report/ ' 
Writeln (Report/ 
Write (Report/ ' 
Writeln (Report/ 
Write (Report/ ' 
Writeln (Report/ 
WritelnC Report); 
percentpostgrad 



1 e . numf ound; 
und; 

e . numf ound; 
veryhard 
hard 

= challenging 
*); 

*/ percentgrad) ; 
' / percenthigh) 



X) 



8 ^ 



procedure Cal culate^Tangibi 1 ity (profile : profiletype; 

var analysis : analysistype) ; 

begin 

Writeln (Report); Writeln( Report ) ; Wri teln( Report ) ; 

Writeln (Report. * TANGIBI LITY » : ^5) ; 

Nri tel n( Report ) ; Wri tel n( Report ) ; 

Write (Report, * Number of Tangible words ;*); 

Writeln (Report, profile. NumTangible) ; 

Write (Report, ' Number of Intangible words :*); 

Writeln ( Report, profile. Numlntangibl e) ; 
if (Info. Genre = Nonfiction) then begin 

if ( profi le . Numlntangibl e > prof i 1 e . NumTangibl e X 4) then 
Analysis . Tangibi lity := Soft 

else if ( profi le . Numintangible > prof i 1 e . NumTangible ^ 2) then 
Analysis . Tangibil ity := Firm 
else Analysis . Tangibility := Solid; 



end 

else genre = Fiction ^) begin 

if (profile . Numintangible > profi le . NumTangible X 3) then 
Ana lysis . Tangi bi li ty := Soft 

else if ( profil e . Numintangible > profile . NumTangible ^ 1.5) then 
Analysis . Tangibi 1 ity := Firm 
else Analysis . Tangi bi 1 i ty := Solid; 
end; 

Writeln (Report); 

Write (Report, ’Tangibility is *); 
case Analysis . tangibi lity of 

Solid : Writeln (Report, ' very tangibible . * ) ; 

Firm : Writeln (Report, * tangibible.’); 

Soft : Writeln (Report, ' very intangi bible . ’ ) ; 
end; 

end; (X calculate tangibility X) 
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procedure Calculate_Emotional_Tone (profile : profiletype; 

var Analysis : ana lysistype ) ; 

var 

PerCent Sublime, 

PerCentPl easant , 

PerCentUnpleasant , 

PerCentHorrid, 

Emotion Index : integer; 
begin 

Nriteln (Report) 

Hriteln (Report, 

Nri teln( Report ) ; 

Write (Report, • 

Nriteln (Report, 

Write (Report, ’ 

Nriteln (Report, 

Write (Report, ’ 

Nriteln (Report, 

Write (Report, • 

Nriteln (Report, 

Write (Report, • 

Nriteln (Report, 

Nri tel n( Report ) ; 



; Nri tel n( Report ) ; Writeln( Report ) ; 

'EMOTIONAL CONNOTATIONS A5) ; 

Writeln(Report); 

Sublime connotations : '); 

profile. NumSubl ime) ; 

Pleasant connotations : '); 

profile. Numpleasant) ; 

Neutral connotations : '); 

profi le . NumNeutral) ; 

Unpleasant connotations : ' ) ; 

profile. Numunpleasant); 

Horrid connotations : '); 

profile. NumHor rid); 

Nri tel n( Report ) ; W rit el n( Report ) ; 

PercentSubl ime := prof i 1 e . numsubl ime5(l 0 0 div prof i le . numf ound; 
percentPl easa nt := prof i le . numpleasant^^l 00 div prof i 1 e . numf ound; 
percentUnpl easant := prof i 1 e . numunpl easant^l 0 0 div prof i 1 e . numf ound; 



percenthorrid 
Emo ti onindex 



profile . numhorrid^lOO div prof i 1 e . numf ound; 

( percentsublime 5^ 5) + 

( percentpl easant 2 ) + 

( percentunpl easa nt 5^ 2) + 

( percenthorrid ^ 5); 

if ( Emotionindex > 20) then Analysis . emoti onali ty := rich 
else if (Emotionindex > 10) then Analysis . emotiona li ty := 
else Analysis . emotionality := poor; 

Nriteln (Report, 'Percent of sublime connotations ' , PercentSublime) ; 
'Percent of pleasant connotations ' 

'Percent of unpleasant connotations '). 

percentUnpl easant); 

'Percent of horrid connotations 



Nriteln (Report, 

Write (Report, 

Nriteln (Report, 

Nriteln (Report, 

Nriteln (Report); 

Nriteln (Report, 

Nriteln (Report); 

Write (Report, 

case Analysis . Emotiona lity of 

Rich: Nriteln (Report, 'Rich'); 



standard 



. percentPleasant ) ; 



'Index of Emotionality 
'Emotionality is '); 



' , percenthorrid) ; 
Emotionindex) ; 



Poor: Nriteln (report, 'poor'); 
end ; 

if (((percentsublime ^ 5) +percentpl easant ) > 

( ( percenthorrid 5) + percentunpl easant ) ^ 1.2) then 
Ana lysis . Tone := positive 

else if ( ( ( per centsublime ^ 5) +pe rcentpl easant ) ^ 1.2 < 

( ( percenthorrid ^ 5) + percentunpleasant ) ) then 
Ana lysis . Tone := negative 
else Analysis . Tone := bland; 

Nriteln (Report); 

Write (Report, 'Tone is '); 
case Ana lysis . Tone of 

Positive: Writeln (Report, 'Positive'); 

Bland: if (analysis . emotionality= rich) then begin 

Write (Report, 'a balance of strong positive '); 
Writeln (Report, 'and strong negative emotions.'), 
end else Writeln (Report, 'bland.'); 
negative: Writeln (report, 'negative'); 
end ; 

end; (^ procedure calculate emotional tone 
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procedure Cal cula te_Vi gor_of_Words 
var 

PerCentVi olent / 

PercentEnergeti C/ 

PercentCalm/ 

Strengthindex : integer; 
begi n 

Writein (Report) 

Writeln (Reports 
Wri tel n( Report ) ; 

Write (Report/ * 

Writein (Report/ 

Wri te( Report / * 

Writein (Report/ 

Write (Report/ ’ 

Writein (Report/ 

Write (Report/ * 

Writein (Report/ 

Writeln( Report); 
percentViolent ;= 
percent Energetic 



(profile J profiletype; 
var analysis ; ana lysistype ) ; 



Wri tel n( Report) ; 
;< 45 ); 



), 



; Wri teln( Report ) . 

•VIGOR OF WORDS' 

Wri teln(Re port); 

Words of Extreme Vigor : •); 

profile . NumViolent) / Writein; 

Words of Much Vigor : •); 

profile. NumEnergeti c ) ; 

Words of Some Vigor ’ ' 

profile. NumCalm) ; 

Words of Little Vigor ; ' 

profile. numinert); 

Wri tel n( Report ) ; Writein ( Report ) ; 
profile . numviolent^lOO div profile . numfound; 

:= prof i 1 e . numenergeti c3^1 00 div prof i le . numfound; 
percentCalm := profi le . numcalm^^lOO div prof i 1 e . numfound; 
Strengthindex : = percentviol ent 10 + 

percentenergeti c 5 + 

percentcalm; 

if ( Strengthlndex> 60) then Ana lysis . St rength := VeryStrong 



); 



else if (Strengthindex > 50) then 
else if (Strengthindex > 20) then 
else Analysis . Strength := weak; 



Analysis . Strength 
Analysis . Strength 



Write 
Writein 
Write 
Writein 
Wri te 
Writein 
Writein 
Writein 



( Report / 

( Report/ 

( Report/ 

( Report/ 

( Report/ 
(Report/ 

( Report) ; 
( Report/ 
Wri tel n( Report ) 
Write (Report/ 



•Percent of words of 
percentViolent) ; 
•Percent of words of 
percentEnergetic) ; 
•Percent of words of 
percentcalm) ; 

• Index of Vigor • j 

; 

•Vigor is •); 



extreme vigor : 
much vigor : 
some vigor 

Strengthindex) ; 



strong 

lively 

•); 

•); 

»); 



case Analysis . Strength of 

VeryStrong; Writein (Report/ 'very strong'); 
strong; Writein (Report/ 'strong’); 
lively: Writein (report/ 'lively'); 
weak ; Writein (Report/ ’weak.'); 
end; 

end; (^ procedure calculate vigor of words ^) 



procedure Wri te_NumRecom (Var Numrecom ; integer); 
begin 

NumRecom ;= NumRecom + 1; 

Wri tel n( Report ) ; Wri teln( Report) ; 

Writein (Report, 'RECOMMENDATION NUMBER ';55/ NumRecom;2); 
Writein (Report); Wri tel n( Report) ; 
end; 
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procedure Make_Recommenda ti ons (analysis : ana lysistype ) ; 
var NumRecom : integer; (3( # of recommenda ti ons made so far 
begin 

Writeln (Report); Writeln (Report); 

NumRecom : = 0 ; 
with analysis do begin 

if (Runons = Unacceptable) then begin 
Wri te_NumRecom (NumRecom); 

Writeln (Report, ’You tend to write run-on sentences . * ) ; 

Writeln (Report, ’Check your longest sentences for run ons.’); 
Writeln (Report, ’Break them up into units of single ideas.’); 

end; 

if (SentLength = TooLong) and (HardWords > Easy) then begin 
Wri te_NumRecom (NumRecom); 

Write (Report, ’Your average sentences are too long ’); 

Writeln (Report, ’for the difficulty of your vocabulary.’); 
Writeln (Report, ’Use simpler words or shorter sentences.’) 

end; 

if (Modulation = Bad) then begin 
Wri te_NumRecom (NumRecom); 

Write (Report, ’Your sentences tend to be too short or too’); 
Writeln (Report, ’long. Try to moderate and modulate ’); 

Writeln (Report, ’the length of your sentences . ’ ) ; 

end; 

if (Etymology = TooBorrowed) then begin 
Wri te_NumRecom (NumRecom); 

Writeln (Report, ’Use shorter, more native English words.’); 

end; 

if (Emotionality = Poor) then begin 
Wri te_NumRecom (NumRecom); 

Writeln (Report, ’Use more words that provoke emotions.’); 

end; 

if (Info. Genre = Fiction) then begin 
if (Strength < Strong) then begin 
Wri te_NumRecom (NumRecom); 

Writeln (Report, ’Use more evocative, sensory words.’); 
end; 

if (Etymology < Mixed) and (Hardwords > Challenging) and 
(SentLength > Medium) then begin 
Wri te_NumRecom (NumRecom); 

Writeln (Report, ’Your fiction reads like non-fiction.’); 
Writeln (Report, ’Unless you are targetting a highly ’); 
Writeln (Report, ’literate audience, keep it simple.’); 
end; 

if (Tangibility = Soft) then begin 
Wri te_NumRecom (NumRecom); 

Write (Report, ’A narrative should be concrete and ’); 
Writeln (Report, ’detailed. Describe things, not ideas. ’); 
end; 

end else begin (^ genre is nonfiction 

if (Etymology = TooNative) and (Strength = VeryStrong) then begin 
Wri te_NumRecom (NumRecom); 

Writeln (Report, ’Your non-fiction reads like fiction.’); 

Write (Report, ’Ask yourself if it is too vigorous ’); 
Writeln (Report, ’ for the audience that will read it.’); 
end; 

if (Strength = Weak) then begin 
Wri te_NumRecom (NumRecom); 

Writeln (Report, ’Use more evocative, sensory words.’); 
end ; 
end; 
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if (Strength > Lively) and (Emotionality > Standard) and 
(Modulation = Good) then begin 

Write (Report, 'Congratulations! You write with *); 

Writeln (Report, 'strength and grace. '); 
end; • 

if (Runons = nonexistant) then begin 

Write (Report, 'Congratulations! You never seem '); 

Writeln (Report, 'to write run-on sentences.'); 

end; 

If (NumRecom = 0) then begin 
Writeln (Report); 

Write (Report, 'This is a solid piece of writing '); 

Writeln (Report, 'well within the traditions of its genre.'); 
Writeln (Report); 

Write (Report, 'You are as able to understand the meaning of); 
Writeln (Report, ' the above characteristi cs as The Stylist.'); 
Writeln (Report, 'Stand the course!'); 

end; 

end; (5C with analysis do ^) 

end; (^ procedure make recommendations X) 
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procedure Stor e_New_Dix; 
begin 

Rewrite (Dixonary, 'Dixonary text a'); 

Page; Writeln; Writeln; Writeln; Writeln; 

Write ('Stylist is now storing the new dictionary 55) ; 
Store^BST (Root); 
end; 



procedure Store^BST (p : entrypointer ) ; 
begin 

if (p <> nil) then begin 

if (p-. status = storable) then Store (p^)/ 
Store_BST (p-.left); 

Store_BST (p- . right) ; 
end; 
end; 



procedure Store (Entry : entrytype); 
var 

sourcel tr , 
dif ficul tyl tr , 

concretenessl tr , emotionltr, 
vigorltr : char; 
begin 

with entry do begin 

if (source = Latinate) then sourceltr 

else sourceltr 



' 1 ' 

* g* ; 



case difficulty of 

Postgrad : dif f icul tyl tr 
Grad : di f f i cul tyl tr 

High_School : dif f icul tyl tr 
Elementary ; di f f i cu 1 tyl tr 
end; 

if ( concreteness = Tangible) then concretenessl tr := 't' 



'PV 

'g' 

'h'. 

' e ' j 



case emotion of 



else concretenessl tr 



: = ’ • 



Sublime 


emotionltr 


= 


•s'; 


Pleasant 


emotionltr 


= 


•p'; 


Neutral 


emotionltr 




'n'; 


Unpleasant 


emotionltr 


= 


•u'; 


Hor ri d 
1 . 


emotionltr 


= 


'h'; 


1 , 

e vigor of 


Violent 


vigorltr := 


'v 


f . 

, 


Energetic 


vigorltr := 


' e 


f . 


Calm 


vigorltr := 


'c 


t • 

, 


Inert 


vigorltr := 


'i 


• • 

, 



end; 

end; (3^ with entry do X) 
Writeln (Dixonary, 



end; 



Entry. word, sourceltr, dif f icul tyl tr, 
concretenessl tr, emotionltr, vigorl tr) ; 
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APPENDIX D 

THE CODE OF DIXSPLIT AND DIXJOIN 



(X NOTE : THE FOLLONING IS NOT IN EXECUTABLE ORDER. 

IT IS IN THE ORDER OF GREATEST CONCEPTUAL CLARITY. X) 

(x$S60000^) 

program Dixsplit (input, output); 
const 

maxsent = 1000; 
wordlength = 15; 
linolength = 60; 
phrasalength = 
freqnum = 50; 
selnum = 200; 
type 

Genretype = (Nonfiction, Fiction); 

Frequencytype = (Frequently, Seldomly, When_Done); 

Balancetype = (Plus, Zero, Minus); 

Durationtype = (Permanent, Temporary); 

Sourcetype = (Latinate, Germanic); 

Di f f i cul ty type = (PostGrad, Grad, High_School, Elementary); 
Concratenesstype = (Tangible, Intangible); 

Emotiontype = (Sublime, Pleasant, Nuetral, Unpleasant, Horrid); 
Vigortype = (Violent, Energetic, Calm, Inert); 

Hordtype = packed array ( . 1 . . wordl ength . ) of char; 

Linetype = packed array (. 1 .. 1 inel ength . ) of char; 

Phrasetype = packed array (. 1 .. phrasel ength . ) of wordtype; 
EntryPointer = -Entrytype; 

ConcordPoi nter = -'Concordtype; 

Entrytype = record 
Word : Wordtype; 

Balance : Balancetype; 

Duration : Dura ti ontype; 

Source : Sourcetype; 

Difficulty j Di f f i cul tytype; 

Concreteness : Concret enesstype; 

Emotion : Emotiontype; 

Vigor : Vigortype; 

Left, Right ; En t ryPoi nt er ; 

Down : Concordpoi nt er ; 
end; 

Concordtype = record 
Up : EntryPointer ; 

Down , 

Next, Last * ConcordPointer ; 
end; 
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van 

FourRoot / 

Root ; EntryPointer ; 
Lastward •• ConcordPointer; 
Latina t/ 

Germane / 

Tangble, 

I ntngbl 0/ 

Postgrd/ 

Graduate, 

HighSchl, 

El ementa , 

Sublme, 

PI easnt , 

Nuetrl, 

Unpl easn , 

Horrd, 

Violnt , 

Energeti , 

C 1 m , 

Inrt, 

Dixonary : text; 



procedure Attach (Entry : entrytype; var p 
begin 

New (p); 

with p-» do begin 

word := entry. word; 
duration ;= permanent; 
source := entry . source; 
difficulty *. = entry . difficulty; 
concreteness := entry . concreteness ; 
emotion := entry . emotion; 
vigor := entry. vigor; 
left := nil; right := nil; 
down ; = nil; 
end ; 
end; 



: entrypointer ) ; 
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procedure Encode (its : entrytype; var xsource : char; 
var xdifficulty *. char; 

var xconcreteness : char; var xemotion 
var xvigor *. char); 

begin 

if (its. source = Latinate) then xsource := *1’ 

else xsource := *g*; 

case its . difficulty of 



Postgrad 


xdifficulty 


: = 'p' ; 


Grad 


xdifficulty 


: = 'g* ; 


Hi gh_School 


xdifficulty 


:= 'h'; 


Elementary 


xdi f f icul ty 


•.= »e*; 



end; 

if ( its . concreteness = Tangible) then Xconcreteness := 

else Xconcreteness := 

case its. emotion of 



Sublime 


: xemotion 


= ’s’; 


Pleasant 


: xemotion 


= ’P’; 


Nuetra 1 


: xemotion 


- ’n ' ; 


Unpleasant 


: xemotion 


= ’u’; 


Horri d 


: xemotion 


= ’h’; 


i f 

;e its. vigor 


of 




Violent 


: xvigor ; = 


’v’; 


Energetic 


: xvigor := 


’e’; 


Calm 


: xvigor := 


’c’; 


Inert 


: xvigor : = 


'i’i 



end; 

end; 



: char; 



»t» 



procedure Decode (var its : entrytype; 

xsource char; xdifficulty 
xconcreteness : char; xemoti 
xvigor : char); 

begi n 

if (Xsource = *1*) then its. source := Latinate 

else its. source := Germanic; 

case Xdifficulty of 





»pt 


its 


. difficul 


ty 


:= Postgrad; 




*g* 


its 


. difficul 


ty 


: = Gra d; 




»h* 


its 


. di f f i cul 


ty 


:= High_School; 




• e» 


its 


. difficul 


ty 


: = Elementary; 


end 


; 










if 


( Xconcret< 


eness= *t 


’) 


then its . concreteness 












else i ts . concreteness 


case Xemotion of 








*s* 


its 


. emotion 


; = 


Sublime; 




*p* 


its 


. emoti on 


; = 


Pleasant; 




*n* 


its 


. emotion 


• = 


Nuetral ; 




»u* 


i ts 


. emoti on 


; = 


Unpl easant ; 




*h* 


its 


. emotion 


• = 


Horrid; 


end 


; 










case Xvigor ( 


of 








iv» 


its 


.vigor : = 


V: 


i olent ; 




»e* 


its 


.vigor := 


El 


nergeti c ; 




»c» 


its 


.vigor := 


Calm; 




*i » 


its 


. vigor : 




Inert; 



end; 

end; 



: char; 
on : char; 



Tangible 

Intangible; 
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procedure AVL_Insert (Entry : entrytype; var p : Ent rypoint er ; 

van balanced : boolean); 



var 

pi, p2 : entrypointer; 
begin 

if (p = nil) then begin 
Attach (Entry, p); 

P-. balance := Zero; 
balanced := true; 

end 

else if (Entry. word = p^.word) then Writeln (’INSERT COLLISION’) 
else if (Entry. word < p^.word) then begin 
AVL_Insert (Entry, p^.left, balanced); 

If Balanced then (^ left pointer has grown higher X) 
case P-. balance of 

Plus : begin p^. balance := Zero; balanced := false; end; 

Zero : p^. balance ••= Minus; 

Minus : begin (x rebalance ^) 
pi : = p^ . 1 eft ; 

if (pl^. balance = Minus) then begin(^ single LL rotate) 
p-'.left := pi-. right; 
pi-. right ;= p; 
p-. balance := Zero; 
p : = pi ; 
end (^ if ^) 

else begin (^ double LR rotation ^) 
p2 : = pi- . ri ght ; 
pi-. right := p2-.left; 
p2-.left ;= pi; 
p-.left := p2-. right; 
p2-. right := p; 

if (p2-. balance = Minus) then p-. balance := Plus 
else P-. balance := Zero; 

if (p2-. balance = Plus) then pi-. balance := Minus 
else pi-. balance : = Zero; 
p := p2; 
end; (^ else 50 

P-. balance ;= Zero; balanced .* = false; 
end; ( 5( case of Minus 50 
end; ( 5( of cases 

end ( 5( if Entry. word < p-.word 5() 
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else if ( Entry. word > p-'.word) then begin 

AVL_Insert (Entry/ p-. right/ balanced); 
if balanced then (X right pointer has grown higher 
case p^. balance of 

Minus : begin p-. balance := Zero; 

balanced := false; end; 

Zero : p^. balance := Plus; 

Plus ; begin rebalance 
pi := p^. right; 

if (pi-. balance = Plus) then begin 
(5C single RR 
P-. right *. = pi-. left; 
pi-. left j= p; 

P-. balance := Zero; 
p 5 = p 1 ; 
end (X if X) 

else begin (^ double RL rotation 
p2 t = pl- . left; 
pl-.left ; = p2-. right; 
p2-. right := pl; 

P-. right := p2-.left; 
p2- . left : = p; 

if (p2-. balance = Plus) then 
P-. balance := Minus 
else P-. balance := Zero; 
if (p2-. balance = Minus) then 
pl-. balance := Plus 
else pl-. balance := Zero; 

P := p2; 

end; double RL rotation 
P-. balance ;= Zero; balanced := false; 
end; (^ case of balance = Plus ^) 
end; (^ of cases 

end (X of if Entry. word > p-.word ^) 
else balanced := false; 
end; of procedure AVL_Insert 
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procedure Ini*t_Researcher ; 
var 
i / 

stepcount , 
threshhold, 
dixlength : integer; 
s, d, c, e, V : char; 
entry : entrytype; 
balanced ; boolean; 
begin 

Reset (Dixonary/ 'Dixonary text a'); 

New (Root); 

Readln (Dixonary, Root^.word, s, d, c, e, v); 

Decode (Root^, s, d, c, e, v); 

Root-. left := nil; Root-. right := nil; 

Root-. down : = nil; 

Root- . balance := Zero; 

Root- . duration := Permanent; 

Lastward ; = nil; 

Stepcount := 0; 

while not EOF (Dixonary) do begin 

Readln (Dixonary, Entry. word, s, d, c, e, v); 

Decode (Entry, s, d, c, e, v); 

Entry . balance := Zero; balanced := false; 

AVL_Insert (Entry, Root, balanced); 
stepcount := stepcount + 1; 
if (stepcount mod 200 = 0) then begin 

Page; Nriteln; Writeln; Writeln; Writeln; 

Writeln ('Stylist is now loading its dictionary.'); 

Writeln ('Last word loaded was ', ent ry . word : 16 ) ; Writeln; 
Writeln ('Stylist has loaded ', stepcount : 5, ' entries.'); 
Writeln; Writeln; 
end; (^ if 

end; while not EOF ^) 
end; 
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procedure Inorder (var Root : entrypointer ) ; 
begin 

if (Root <> nil) then begin 
Inorder ( Root-* . 1 eft ) ; 
case Root-. Source of 

Latinate : Writeln (Latinat^ Root-.word^ *1*:1); 
Germanic : Writeln (Germane^ Root-. word, *g*:l); 

end; 

case Root- .Concreteness of 

Tangible : Writeln (Tangble, Root-. word, *t*:l); 
Intangible : Writeln (InTngble, Root-. word, ’i*:l); 

end; 

case Root- . difficulty of 

Postgrad : Writeln (Postgrd, Root-. word, *p*:l); 
Grad : Writeln (Graduate, Root-. word, *g’:l); 
High_School : Writeln (HighSchl, Root-. word, *h*:l) 
Elementary ; Writeln (Elementa, Root-. word, *e’;l); 

end; 

case Root- . emoti on of 

Sublime : Writeln (Sublme, Root-. word, *s*;l); 
Pleasant ; Writeln (Pleasnt, Root-. word, *p’:l); 
Nuetral : Writeln (Nuetrl, Root-. word, *n*:l); 
Unpleasant : Writeln (Unpleasn, Root-. word, *u*:l); 
Horrid : Writeln (Horrd, Root-. word, *h*:l); 

end; 

case Root-. vigor of 

Violent : Writeln (Violnt, Root-. word, *v*:l); 
Energetic i Writeln (Energeti, Root-. word, *e*); 
Calm : Writeln (Clm, Root-. word, 'c*); 

Inert : Writeln (Inrt, Root-. word, *i*); 

end; 

Inorder ( Root- . right ) ; 

end; 

end; 
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(Tangble^ 'Tangble text a’); 
(Intngble, *Intngble text a’); 



*Postgrd text a’), 



* Graduate 
’ HighSchl 



text 

text 



begin main program 
Ini t_Researcher ; 

Rewrite (Latinat^ *Latinat text a 
Rewrite (Germane^ ’Germane text a 
Rewrite 
Rewrite 

Rewrite 
Rewrite 
Rewrite 
Rewrite 
Rewrite 
Rewrite 
Rewrite 
Rewrite 
Rewrite 
Rewrite 
Rewrite 
Rewri te 
Rewrite 
I norder 
end . 






( Postgrd/ 

(Graduate/ 

( HighSchl / 

(Elementa, ’Elementa text 
(Sublme/ ’Sublme text a 
(Pleasnt/ ’Pleasnt text 
(Nuetrl/ ’Nuetrl text a*)i 
(Unpleasn, ’Unpleasn text 
(Horrd, ’Horrd text a’); 
(Violnt/ 'Violnt text a’)j 
(Energeti, ’Energeti text 
(Clm, ’Clm text a ' ) ; 



a’), 

a’) 

a'). 



a*); 



a’), 



(Inrt, 

( Root) . 



’ Inrt text a * ) ^ 
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(X$S60000X) 

program Dixjoin (inputs output); 
const 

maxsent = 1000; 
wordlength = 15; 
linelength = 60; 
phraselength = 4; 
freqnum = 50; 
selnum = 200; 

type 

Genretype = (Nonfiction^ Fiction); 

Frequencytype = (Frequently, Seldomly, When_Done); 

Balancetype = (Plus^ Zero, Minus); 

Durationtype = (Permanent, Temporary); 

Sourcetype = (Latinate, Germanic); 

Di f f icul tytype = (PostGrad, Grad, High_School, Elementary); 
Concretenesstype = (Tangible, Intangible); 

Emotiontype = (Sublime, Pleasant, Nuetral, Unpleasant, Horrid); 
Vigortype = (Violent, Energetic, Calm, Inert); 



Wordtype = packed array ( . 1 . . wordl ength . ) of char; 

Linetype = packed array (. 1 .. 1 i nel ength . ) of char; 
Phrasetype = packed array (. 1 .. phrasel ength . ) of wordtype; 

EntryPointer = -Entrytype; 

ConcordPointer = ^Concordtype; 

Entrytype = record 
Word : Wordtype; 

Balance : Balancetype; 

Duration : Dura ti ontype; 

Source : Sourcetype; 

Difficulty : Di ffi cul tytype; 

Concreteness : Concretenesstype; 

Emotion : Emotiontype; 

Vigor : Vigortype; 

Left, Right : EntryPointer; 

Down : Concordpointer ; 
end; 

Concordtype = record 
Up : Ent ryPoi nter ; 

Down , 

Next, Last : ConcordPointer ; 
end; 
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FourRoot , 

Root : EntryPointer; 

Lastward : ConcordPointer; 

Latinat/ 

Gernanc , 

Tangbl e, 

Intngbl 

Postgrd/ 

Graduate, 

Hi ghSchl , 

El ementa , 

Subl me , 

P 1 easnt , 

Nuetrl , 

Unpi easn , 

Horrd, 

Vioint , 

Enargeti , 

Clm, 

Inrt , 

Dixonary : text; 
word t wordtype; 

P : entrypointer ; 
c : char; 

procedure Attach (Entry : entrytype; var p 
begin 

New ( p ) ; 

with p- do begin 

word := entry. word; 
duration : = permanent; 
source := entry . source; 
difficulty := en try . di f f i cul ty; 
concreteness := entry . concreteness ; 
emotion := entry . emotion; 
vigor := entry. vigor; 
left := nil; right := nil; 
down := nil; 
end; 
end; 



: entrypointer ) ; 
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procedure Encode (its : entrytype; var xsource : char; 
van xdifficulty : char; 

var xconcreteness : char; var xemotion 
var xvigor : char); 



cha r ; 



begin 
if (its 



source = Latinate) then xsource 

else xsource 

case its. difficulty of 

xdi f f i cul ty 
xdi f f icul ty 
xdifficulty 
xdifficulty 



; 



= *h* 



Postgrad 
Grad 

Hi gh_School 
Elementary 
end; 

if ( i ts . concreteness = Tangible) then Xconcreteness := 't* 



else Xconcreteness 



= • ■ 



case its.emoti 


on of 




Sublime 


: xemotion 


= 's'; 


Pleasant 


: xemotion 


= 'p' ; 


Nuetra 1 


: xemotion 


= *n'; 


Unpleasant 


: xemotion 


= 'u'; 


Ho rri d 


: xemotion 


= 'h'; 


end; 






case its. vigor 


of 




Violent 


: xvigor := 


•v'; 


Energetic 


: xvigor := 


'e'; 


Calm 


: xvigor : = 


•C; 


Inert 


: xvigor := 


'i'; 



end; 

end; 



procedure Decode (var its ; entrytype; 

xsource : char; xdifficulty : char; 
xconcreteness : char; xemotion : char; 
xvigor : char); 

begin 

if (Xsource = *1') then its. source := Latinate 

else its. source := Germanic; 



case Xdifficulty of 

'p' : its . difficulty 

'g' : its. difficulty 

’h’ : its . difficulty 

: i ts . difficulty 



Postgrad; 

Grad; 

High_School ; 
El ementary; 



end; 

if (Xconcreteness= *t*) then i ts . concreteness 

else i ts . concreteness 



case Xemotion of 



*s' 


: its. emotion 


:= Sublime; 


tpt 


: its. emotion 


:= Pleasant; 


'n' 


: its. emotion 


:= Nuetral; 


»u» 


: its. emotion 


:= Unpleasant; 


’h' : its. emotion 

id; 

ise Xvigor of 


:= Horrid; 


»v» 


: its. vigor := 


Violent; 


f e» 


: its. vigor := 


Energetic; 


’c' 


: its. vigor := 


Calm; 


'i' 

id; 


: its. vigor : 


= Inert; 



end; 



Tangible 

Intangible; 
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procedure AVL_Insert (Entry 



entrytype; var p 
var balanced 



Entrypointer ; 
boolean) ; 



var 

pl, p2 : entrypointer; 
begin 

if (p = nil) then begin 
Attach (Entry, p); 

P-. balance := Zero; 
balanced := true; 

end 

else if (Entry. word = p-^.word) then Writeln (’INSERT COLLISION’) 
else if (Entry. word < p^.word) then begin 
AVL_Insert (Entry, p^.left, balanced); 

If Balanced then (^ left pointer has grown higher X) 
case p-. balance of 

Plus : begin p*'. balance := Zero; balanced := false; end; 

Zero : p*'. balance := Minus; 

Minus : begin (^ rebalance X) 
pl := p-'.left; 

if (pl^. balance = Minus) then begin(^ single LL rotate) 
P^.left := pl-'. right; 
pl^. right := p; 

P". balance *. = Zero; 

P := pl; 
end (^ if ^) 

else begin (X double LR rotation ^) 
p2 : = pl-' . right; 
pl-. right := p2-.left; 
p2-'.left := Pl; 

P-.left := p2-'. right; 



p2-'. right := p; 

if (p2-. balance = Minus) then p-'. balance := Plus 
else p-'. balance := Zero; 

if ( p2-' . ba la nee = Plus) then pl-. balance ;= Minus 
else pl-. balance := Zero; 

P •• = p2; 
end; (^ else ^) 

P-. balance ;= Zero; balanced false; 
end; (^ case of Minus ^) 
end; (^ of cases ^) 
end (^ if Entry. word < p-.word ^) 
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else if ( Entry. word > p-.word) then begin 

AVL_Insert (Entry> p-. rights balanced); 
if balanced then ( right pointer has grown higher 
case P-. balance of 

Minus : begin p^. balance ;= Zero; 

balanced := false; end; 

Zero : p^. balance := Plus; 

Plus : begin rebalance 
pi := P-. right; 

if ( pi-' . balance = Plus) then begin 
single RR 3() 

P-. right := pi-. left; 
pi-. left : = p; 

P-. balance := Zero; 

P ;= pl; 
end (X if x) 

else begin (X double RL rotation x) 
p2 := pl-.left; 
pl-.left := p2-. right; 
p2-. right := pl; 

P-. right := p2-.left; 
p2-.left := p; 

if (p2-. balance = Plus) then 
P-. balance := Minus 
else P-. balance := Zero; 
if (p2-. balance = Minus) then 
pl-. balance *. = Plus 
else pl-. balance := Zero; 

P := p2; 

end; (x double RL rotation 3() 

P-. balance := Zero; balanced := false; 
end; (x case of balance = Plus 
end; of cases X) 
end of if Entry. word > p-.word 50 

else balanced := false; 
end; ( 5( of procedure AVL_Insert 50 



103 



procedure I ni t_Researcher ; 
var 
i ^ 

stepcount/ 
threshhol d/ 
dixlength : integer; 
s, d, c, Q, V ; char; 
entry : entrytype; 
balanced : boolean; 
begin 

Reset (Dixonary, • Dixonary text a*); 

New (Root); 

Readln (Dixonary, Root-. word, s, d, c, e, v); 

Decode (Root-, s, d, c, e, v); 

Root-. left := nil; Root-. right := nil; 

Root-. down := nil; 

Root- . ba lance := Zero; 

Root- , durati on .*= Permanent; 

Lastward := nil; 

Stepcount := 0; 

while not EOF (Dixonary) do begin 

Readln (Dixonary, Entry. word, s, d, c, e, v); 

Decode (Entry, s, d, c, e, v); 

Entry . balance := Zero; balanced := false; 

AVL_Insert (Entry, Root, balanced); 
stepcount : = stepcount + 1; 
if (stepcount mod 200 = 0) then begin 

Page; Writeln; Writeln; Writeln; Writeln; 

Writeln (’Stylist is now loading its dictionary.’); 

Writeln (’Last word loaded was ’, entry . word : 16 ) ; Writeln; 
Writeln (’Stylist has loaded stepcount : 5 , ’ entries.’); 
Writeln; Writeln; 
end; ( if ^) 
end; (^ while not EOF ^) 
end; 



procedure Inorder (var Root : ent rypointer ; 

word : wordtype; 
var P ; entrypointer ) ; 



begi n 

if (Root-. word = word) then p := root 

else if (root-. word > word) then Inorder (Root-. left, word, p) 
else Inorder ( Root- . ri ght , word, p); 
end; 



procedure Store_BST (p ; entrypoi nter ) ; 
var s, d, c, e, v : char; 
begi n 

if ( p <> nil) then begin 

Encode (p-, s, d, c, e, v); 

Writeln (Dixonary, p-.word, s, d, c, e, v); 
Store^BST (p-.left); 

Store_BST (p-. right); 
end ; 
end; 

procedure Store_New_Dix; 
begin 

Rewrite (Dixonary, ’Dixonary text a’); 
Store^BST (Root); 
end; 



lOA 



begin main program X) 



Init_Researcher; 

Reset (Latinat/ *Latinat text a’); 

While not EOF (Latinat) do begin 
Readln (Latinat/ word/ c); 

if ( c <> *1*) then begin Inorder (root/ word/ p); 
P-. source := germanic; end; 
end; 

Reset (Germane/ ’Germane text a’); 

While not EOF (Germane) do begin 
Readln (Germane/ word/ c); 

if ( c <> ’g’) then begin Inorder (root/ word/ p); 
P-. source := latinate; end; 
end; 

Reset (Tangble/ ’Tangble text a’); 

While not EOF (Tangble) do begin 
Readln (Tangble/ word/ c); 

if ( c <> ’t’) then begin Inorder (root/ word/ p); 
P- . concreteness := intangible; end; 
end; 

Reset (Intngble/ ’Intngble text a’); 

While not EOF (InTngble) do begin 
Readln (Intngble/ word/ c); 

if ( c <> *i’) then begin Inorder (root/ word/ p); 
p- . concreteness := tangible; end; 
end; 

Reset (Postgrd/ ’Postgrd text a’); 

While not EOF (Postgrd) do begin 
Readln (Postgrd/ word/ c); 

if ( c <> *p*) then begin Inorder (root/ word/ p); 
if (c = ’g’) then p^ . difficulty := grad; 

if (c = *h’) then p- . difficulty := high_school; 

if (c = *e’) then p- . di f f i cul ty := elementary; end; 

end / 

Reset (Graduate/ ’Graduate text a’); 

While not EOF (Graduate) do begin 
Readln (Graduate/ word/ c); 

if ( c <> ’g’) then begin Inorder (root/ word/ p); 
if (c = ’p’) then p^ . di f f icul ty ;= postgrad; 

if (c = ’h’) then p-* . difficulty := high_school; 

if (c = ’e’) then p- . difficulty := elementary; end; 

end; 

Reset (HighSchl/ ’HighSchl text a’); 

While not EOF (HighSchl) do begin 
Readln (HighSchl/ word/ c); 



if 


( c 


<> ’h 


’) then begin Inorder 


(root/ word/ 


p); 


if 


(c 


= »p’) 


then 


p- . difficulty := 


postgrad; 




if 


( c 


= »g’) 


then 


p- . dif f icul ty := 


grad; 




if 


( c 


= ‘e’) 


then 


p- . dif f icul ty := 


elementary; 


end; 



end; 

Reset (Elementa/ ’Elementa text a’); 

While not EOF (Elementa) do begin 
Readln (Elementa/ word/ c); 

if ( c <> ’e’) then begin Inorder (root/ word/ p); 
if (c = ’p’) then p-^ . difficulty := postgrad; 

if (c = ’g’) then p- . dif f icul ty := grad; 

if (c = ’h’) then p-* . difficulty := high_school; end; 

end; 
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Reset (Sublme/ *Sublme text a*); 

While not EOF (Sublme) do begin 
Readln (Sublme/ word/ c); 

if ( c <> *s*) then begin Inorder (root/ word/ 



if (c = ’p*) then p-. emotion 
if (c = *n*) then p-. emotion 
if (c = *u*) then p-. emotion 
if (c = ’h’) then p-. emotion 



= pleasant; 

= nuetral; 

= unpleasant; 

= horrid; end; 



p) 



end; 



Reset (Pleasnt/ 'Pleasnt text a’); 

While not EOF (Pleasnt) do begin 
Readln (Pleasnt, word, c); 

if ( c <> *p’) then begin Inorder (root, word, p) 



if (c = *s*) then p^. emotion 
if (c = *n*) then p-. emotion 
if (c = *u*) then p^. emotion 
*h*) then p-. emotion 



if ( c = 
end; 



= sublime; 

= nuetral; 

:= unpleasant; 

!= horrid; end; 



Reset (Nuetrl/ 'Nuetrl text a*); 

While not EOF (Nuetrl) do begin 
Readln (Nuetrl, word, c); 

if ( c <> ’n’) then begin Inorder (root, word, p) 
if (c = *s*) then p-. emotion := sublime; 

if (c = *p*) then p^. emotion := pleasant; 

if (c = *u’) then p-. emotion := unpleasant; 

if (c = *h*) then p^. emotion ;= horrid; end; 

end; 



Reset (Unpleasn, ’Unpleasn text a*); 

While not EOF (Unpleasn) do begin 
Readln (Unpleasn, word, c); 

if ( c <> *u*) then begin Inorder (root, word, p) 



if (c = *s*) then p-. emotion 
if (c = *p*) then p-. emotion 
if (c = *n*) then p-. emotion 
if (c = *h*) then p^. emotion 
end; 



= sublime; 

= pleasant; 

= nuetral; 

= horrid; end; 



Reset (Horrd, *Horrd text a’); 

While not EOF (Horrd) do begin 
Readln (Horrd, word, c); 
if ( c <> *h*) then begin Inorder (root, word, p) 
if ( c 
if ( c 
if (c 
if (c 
end; 



’s*) then P-. emotion 
*p’) then p-*. emotion 
, emotion 
, emotion 



’ n ’ ) then 
’ u * ) then 



sublime; 
pleasant; 
nuetral ; 

unpleasant; end; 



Reset (Violnt, 'Violnt text a*); 

While not EOF (Violnt) do begin 
Readln (Violnt, word, c); 

if ( c <> *v’) then begin Inorder (root, word, p) 
if (c = *e’) then p-. vigor := energetic; 

if (c = *c’) then p-. vigor := calm; 

if (c = *i’) then p-. vigor := inert; end; 

end; 



Reset (Energeti, ’Energeti text a*); 

While not EOF (Energeti) do begin 
Readln (Energeti, word, c); 

if ( c <> *e’) then begin Inorder (root, word, p) 
if (c = *v’) then p-. vigor := violent; 

if (c = ’c*) then p-. vigor := calm; 

if (c = *i*) then p-. vigor := inert; end; 

end; 



Reset (Clm, *Clm text a*); 

While not EOF (Clm) do begin 
Readln iClm, word, c); 

if ( c <> *c*) then begin Inorder (root, word, p) 
if (c = *v*) then p--. vigor ;= violent; 

if (c = *e*) then p-. vigor := energetic; 

if (c = *i*) then p-'. vigor := inert; end; 

end; 

Reset (Inrt, *Inrt text a*); 

While not EOF (Inrt) do begin 
Readln (Inrt, word, c); 

if ( c <> *i') then begin Inorder (root, word, p) 
if (c = *v*) then p-. vigor := violent; 

if (c = *e*) then p-. vigor := energetic; 

if (c = ’c*) then p-. vigor := calm; end; 

end; 

Store_New_Dix; 



end . 
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APPENDIX E 
REPRESENTATIVE RUNS 



PROFILE 



Plato's Phaedrus 



Total of 
Total of 
Total of 



sentences 

words 

letters 



17 

360 

1595 



360 words are too few for valid statistical analysis. 



Of the 360 words in Plato's Phaedrus 

312 were patched to words in the Stylist dictionary. 

86 X were matched. 

This is enough for statistical analysis. 
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BREAKDOWN OF PERCENT OF LETTERS PER WORD 



30 

28 

26 

24 

22 

20 

18 

16 

14 

12 

10 

8 

6 

4 

2 



The average 



2 



number 



X 

XXX 

XXX 

X X X X 

X X X X 

X X X X 

X X X X X 

X X X X X 

X X X X X 

3 4 5 6 7 

Number of Letters in a 

of letters per word 



X 

X 

X 

X X 

XXX 

XXX. 

8 9 10 11 

Word 

: 6 .131428571E+00 



12 



13 



A typical nonfiction texts distribution 

resembles a low bell-shaped curve centered around six letters/word . 
The length of the words is medium 



14 15 
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BREAKDOWN OF NUMBER OF WORDS PER SENTENCE 



1 



Number of words 
100 
98 
96 

9^4 

92 

90 

88 

36 

8^ 

82 

80 

78 

76 

10 , 

72 
70 
68 
6 6 



6^ 

62 

60 

58 

56 

5 ^ 

52 

50 

^8 

OS 

00 

02 

^40 X 

38 X 

36 X 

34 X 

32 X XX 

30 X XX 

28 X XX X 

26 X XX X 

24 XXX XX X 

22 X XXXX XX X 

20 X XXXX XX X 

13 X XXXXX XXX X 

16 XXXXXXXX XXX X 

14 XXXXXXXX XXX X 

12 XXXXXXXXXXXXXXX X 

1 0 xxxxxxxxxxxxxxxxx 

3 xxxxxxxxxxxxxxxxx 
6 xxxxxxxxxxxxxxxxx 

4 xxxxxxxxxxxxxxxxx 
2 xxxxxxxxxxxxxxxxx 

10 20 



30 40 50 

Sentence Number 



60 



70 



The average number of words per sentence : 2 . 1 176 47 059E+0 1 

A typical modern texts sentences average between fifteen and twenty words 
Sentences are long. 



Number of run ons 0 
Percent of run ons 0 
Run ons are nonexistent 
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Number of medium length sentences 9 

Percent of medium length sentences 52 

Modulation is good. 

ETYMOLOGY OF WORDS 

Number of Latinate words : 51 

Number of Germanic words : 95 

Etymology is very native. 

DIFFICULTY OF VOCABULARY 

PostGraduate difficulty : 0 

Graduate difficulty : 1 

High School difficulty ; 25 

Elementary difficulty : 120 

Percent of Postgraduate difficulty 0 

Percent of Graduate difficulty 0 

Percent of High School difficulty 8 

Difficulty is easy. 

TANGIBILITY 

Number of Tangible words : 35 

Number of Intangible words : 111 

Tangibility is tangibible. 

EMOTIONAL CONNOTATIONS 

Sublime connotations : 7 

Pleasant connotations : AA 

Neutral connotations : 87 

Unpleasant connotations : 8 

Horrid connotations : 0 

Percent of sublime connotations 2 

Percent of pleasant connotations 14 

Percent of unpleasant connotations 2 

Percent of horrid connotations 0 

Index of Emotionality 42 

Emotionality is Rich 

Tone is Positive 
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Ol 0- 



VIGOR OF WORDS 



Words 


of 


Extreme Vigor 


0 


Wo rds 


of 


Much Vigor 


17 


Words 


of 


Some Vigor : 


75 


Words 


of 


Little Vigor : 


54 



ercent of words of extreme vigor 
ercent of words of much vigor 
Percent of words of some vigor 



Index of Vigor 
Vigor is lively 



49 



0 

5 

24 



Congratulations! You 
This is a solid piece 



never seem to write run on 
of writing well within the 



sentences . 
tradi ti ons 



of 



its 



genre . 
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Phaedrus 



CONCORDANCE 



them and thamus enquired ABOUT their several uses and 

discovered is not an AID to memory but to 

other egyptians might be ALLOWED to have the benefit 

an art is not ALWAYS the best judge of 

is called by them AMMON to him came theuth 

learned nothing they will APPEAR to be omniscient and 

censured others as he APPROVED or disapproved of them 

many arts such as ARITHMETIC and calculation and geometry 

or inventor of an ART is not always the 

such as arithmetic and 
but when they came 
ASTRONOMY and draughts and dice 
to them a quality 
BECAUSE they will not use 
BEEN led to attribute to 



the inventor of many ARTS 

blame of the various ARTS 

calculation and geometry and 
have been led to ATTRIBUTE 
in the learners* souls 
your own children have 



allowed to have the 
is not always the 
wiser and give them 
name was theuth the 
theuth in praise or 
it is a specific 
such as arithmetic and 
egypt which the Hellenes 



the bird which is 
the god himself is 
them ammon to him 
arts but when they 
a quality which they 
some of them and 



BENEFIT of them he enumerated 
BEST judge of the utility 
BETTER memories it is a 
BIRD which is called the 
BLAME of the various arts 
BOTH for the memory and 

CALCULATION and geometry and astronomy 
CALL egyptian thebes and the 



CALLED the ibis is sacred 
CALLED by them ammon to 
CAME theuth and showed his 
CAME to letters this said 

CANNOT have for this discovery 
CENSURED others as he approved 



to the external written CHARACTERS and not remember of 
love of your own CHILDREN have been led to 
socrates at the egyptian CITY of naucratis there was 
dwelt in that great CITY of upper egypt which 
they will be tiresome COMPANY having the show of 
king of the whole COUNTRY of egypt and he 

discovery of yours will CREATE forgetfulness in the learners’ 

letters now is those DAYS the god thamus was 

and showed his inventions DESIRING that the other egyptians 

astronomy and draughts and DICE but his great discovery 

as he approved or DISAPPROVED of them it would 

and you give your DISCIPLES not truth but only 

specific which you have DISCOVERED is not an aid 

dice but his great DISCOVERY was the use of 

cannot have for this DISCOVERY of yours will create 

geometry and astronomy and DRAUGHTS and dice but his 

of egypt and he DWELT in that great city 

and he dwelt in 
which the Hellenes call 
city of naucratis there 
EGYPTIAN thebes and the god 
EGYPTIANS 



the whole country of EGYPT 
great city of upper EGYPT 
socrates at the EGYPTIAN 
which the Hellenes call 
desiring that the other 



might be allowed to 



theuth will make the EGYPTIANS wiser and give them 
enumerated them and thamus ENQUIRED about their several 
benefit of them he ENUMERATED them and thamus enquired 
will trust to the EXTERNAL written characters and not 
naucratis there was a FAMOUS old god whose name 
you who are the FATHER of letters from the 
of yours will create FORGETFULNESS in the learners’ souls 

be omniscient and will GENERALLY know nothing they will 



uses 



arithmetic and calculation and 
the egyptians wiser and GIVE 

to reminiscence and you GIVE 



GEOMETRY and astronomy and draughts 
them better memories it 
your disciples not truth 
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was a famous old GOD whose name was theuth 

is those days the GOD thamus was the king 

egyptian thebes and the GOD himself is called by 

and dice but his GREAT discovery was the use 

he dwelt in that' GREAT city of upper egypt 

will be tiresome company HAVING the show of wisdom 

truth they will be HEARERS of many things and 

upper egypt which the HELLENES call egyptian thebes and 

thebes and the god HIMSELF is called by them 

which is called the IBIS is sacred to him 

thamus replied o most INGENIOUS theuth the parent or 

them and in this INSTANCE you who are the 

of the utility or INUTILITY of his own inventions 

theuth and showed his INVENTIONS desiring that the other 

inutility of his own INVENTIONS to the users of 

and he was the INVENTOR of many arts such 

theuth the parent or INVENTOR of an art is 

not always the best JUDGE of the utility or 

god thamus was the KING of the whole country 

omniscient and will generally KNOW nothing they will be 

things and will have LEARNED nothing they will appear 

create f orgetf ulness in the LEARNERS* souls because they will 

own children have been LED to attribute to them 

was the use of LETTERS now is those days 

when they came to LETTERS this said theuth will 

are the father of LETTERS from the paternal love 

it would take a LONG time to repeat all 

letters from the paternal LOVE of your own children 

this said theuth will MAKE the egyptians wiser and 

was the inventor of MANY arts such as arithmetic 

will be hearers of MANY things and will have 

and give them better MEMORIES it is a specific 

will not use their MEMORIES they will trust to 

specific both for the MEMORY and for the wit 

not an aid to MEMORY but to reminiscence and 

that the other egyptians MIGHT be allowed to have 



wit thamus replied o 
famous old god whose 
the egyptian city of 
and will have learned 
and will generally know 
the wit thamus replied 
there was a famous 
will appear to be 



MOST 
NAME 

NAUCRATIS 
NOTHING 
NOTHING 
0 most 
OLD god 
OMNISCIENT 



ingenious theuth the parent 
was theuth the bird 

there was a famous 
they will appear to 

they will be tiresome 
ingenious theuth the 
whose name was 

and will generally know 



inventions desiring that the OTHER egyptians might be allowed 

of then and censured OTHERS as he approved or 

or inutility of his OWN inventions to the users 

paternal love of your OWN children have been led 

most ingenious theuth the PARENT or inventor of an 

of letters from the PATERNAL love of your own 

said to theuth in PRAISE or blame of the 

their several uses and PRAISED some of them and 

attribute to them a QUALITY which they cannot have 

of wisdom without the REALITY 

written characters and not REMEMBER of themselves the specific 
to memory but to REMINISCENCE and you give your 
a long time to REPEAT all that thamus said 
for the wit thamus REPLIED 

called the ibis is SACRED 

repeat all that thamus SAID 
came to letters this SAID 
truth but only the SEMBLANCE 



thamus enquired about their 
tiresome company having the 
him came theuth and SHOWED 
SOCRATES at the egyptian 



o most ingenious 
to him and he 

to theuth in praise 
theuth will make the 
of truth they will 



theuth 



SEVERAL uses and praised some 
SHOW of wisdom without the 
his inventions desiring that 
city 
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several uses and praised SOME of them and censured 
f orgetf ulness in the learners' SOULS because they will not 
memories it is a SPECIFIC both for the memory 

remember of themselves the SPECIFIC which you have discovered 



of them it would 
those days the' god 



and thamus enquired about 
they will not use THEIR 



TAKE a long time to 

THAMUS was the king of 
he enumerated them and THAMUS enquired about their several 
to repeat all that THAMUS said to theuth in 
and for the wit THAMUS replied o most ingenious 
the hellenes call egyptian THEBES and the god himself 

THEIR several uses and praised 
memories they will trust 
and not remember of THEMSELVES the specific which you 
egyptian city of naucratis THERE was a famous old 
god whose name was THEUTH the bird which is 
anmon to him came THEUTH and showed his inventions 
that thamus said to THEUTH in praise or blame 
to letters this said THEUTH will make the egyptians 
replied o most ingenious THEUTH the parent or inventor 
be hearers of many THINGS and will have learned 
of letters now is THOSE days the god thamus 

would take a long TIME to repeat all that 

nothing they will be TIRESOME company having the show 
their memories they will TRUST to the external written 
give your disciples not TRUTH but only the semblance 
only the semblance of TRUTH they will be hearers 
that great city of UPPER egypt which the hellenes 
great discovery was the USE of letters now is 
because they will not USE their memories they will 

of them and in 
USES and praised some of 
or inutility of his 
arts but when they 

they came to letters 
is called the ibis 
the hellenes call egyptian 
they cannot have for 
WHICH you have discovered is 
country of egypt and 
name was theuth the 
having the show of WISDOM without the reality 
will make the egyptians WISER and give them better 
memory and for the WIT thamus replied o most 
the show of wisdom WITHOUT the reality 
disapproved of them it WOULD take a long time 
trust to the external WRITTEN characters and not remember 

for this discovery of YOURS will create f orgetf ulness in 



own inventions to the USERS 
enquired about their several 
best judge of the UTILITY 
or blame of the VARIOUS 

the various arts but WHEN 
was theuth the bird WHICH 
city cf upper egypt WHICH 
to them a quality WHICH 
of themselves the specific 
the king of the WHOLE 
a famous old god WHOSE 
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student Paper. Example of Poor Writing 

PROFILE 



Total of sentences 
Total of words 
Total of letters 



62 

2026 

10561 



2026 words are plenty for valid statistical analysis. 



Of the 2026 words in Student Paper. Example of Poor Writing , 
1839 were matched to words in the Stylist dictionary. 

90 X were matched. 

This is enough for statistical analysis. 
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BREAKDOWN OF PERCENT OF LETTERS PER WORD 



30 

28 

26 

2A 

22 

20 

18 

16 

14 

12 

10 

8 

6 

4 

2 



1 



2 



X 

X 

X 

X X 

X X 

X X 

XXX 

XXX 

3 4 5 

Number of Lett 



XXX 

XXX 

XXX 

XXX 

XXX 

XXX 

6 7 8 

ers in a Word 



X 

X 

X 

9 



X 

X X 

X X 

10 11 12 



X 

13 14 



15 



The average number of letters per word : 6 . 9218877 13E+00 

A typical nonfiction texts distribution 

resembles a low bell-shaped curve centered around six letters/word . 
The length of the words is long. 
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1 BREAKDOWN OF NUMBER OF WORDS PER SENTENCE 



Number of 


words 
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58 


XX 
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56 


XX 
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54 


XXX 
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X 
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52 


XXX 




X 
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XX 


50 


XXX 




X 
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XX 
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XX 


48 


XXX 




X 






XX 




XX 
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XX 


46 


xxxx 




X 






XX X 




XX 


X 


XX 


44 


xxxx 




X 


X 




XX X 




XX 


X 


X XX 


42 


xxxx 




X 


X 




XX X 




XXX 


X X 


X XX 


40 


xxxx 




X 


X 




XX X 




XXX 


X XX 


X XX 


33 


xxxx 




X 


X 




XX X X 




XXX 


X XX 


XX XX 


36 


xxxx 




X 


X 




XX X X 


X X 


XXX 


X XX 


XX XX 


34 


xxxx 




X 


X XX 




XX X X 


X X 


XXX 


X XX 


XX XX 


32 


xxxx 




X 


X XX 




XX X X 


X X 


XXX 


X xxxx 


XX XXX 


30 


xxxx 




X 


xxxxx 




xxxx X 


X X 


XXX X 


X xxxx 


XX XXX 


28 


xxxx 




X 


xxxxx 




xxxxx X 


X XX 


XXX X 


X xxxx 


XX XXX 


26 


xxxx 


X 


X 


xxxxx 




xxxxx X 


XX XX 


XXX X 


X xxxx 


XXXXXXXX 


24 


xxxx 


X 


X 


xxxxx 




xxxxxxx 


XX XX 


xxxxx 


xxxxxx 


XXXXXXXX 


22 


xxxx 


XX 


XX 


xxxxx 


X 


xxxxxxx 


XX XX 


xxxxx 


xxxxxx 


XXXXXXXX 


20 


xxxx 


XX 


XX 


xxxxx 


X 


xxxxxxx 


xxxxx 


xxxxx 


xxxxxx X 


XXXXXXXX 


18 


xxxx 


XX 


XX 


xxxxx 


XX 


XXXXXXXX 


xxxxx 


xxxxx 


XXXXXXXX 


xxxxxxxxx 


16 


xxxx 


XX 


XX 


xxxxx 


XX 


xxxxxxxxxxxxxx 


xxxxx 


XXXXXXXX 


xxxxxxxxx 


14 


xxxx 


XX 


XX 


xxxxx 


XXX 


xxxxxxxxxxxxxx 


xxxxxxxxxxxxxxx 


xxxxxxxxx 


12 


xxxx 


XX 


XX 


xxxxx 


XXX 


xxxxxxxxxxxxxx 


xxxxxxxxxxxxxxxxxxxxxxxxx 


10 


xxxx 


XXX 


XX 


xxxxx 


xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 



3 XXXXXXXX XX xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
6 XXXXXXXX XX xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
^ xxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
2 xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

10 20 30 40 50 60 70 

Sentence Number 

The average number of words per sentence : 3 . 267741935E+01 



A typical modern texts sentences average between fifteen and twenty words. 

Sentences are too long. 

Number of run ons 13 
Percent of run ons 20 



Run ons are unacceptable 
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Number of medium length sentences 14 
Percent of medium length sentences 22 

Modulation is average 

ETYMOLOGY OF WORDS 

Number of Latinate words : 708 

Number of Germanic words : 402 

Etymology is mixed 

DIFFICULTY OF VOCABULARY 

PostGraduate difficulty : 26 

Graduate difficulty : 141 

High School difficulty : 405 

Elementary difficulty : 538 

Percent of Postgraduate difficulty 1 

Percent of Graduate difficulty 7 

Percent of High School difficulty 22 

Difficulty is very hard 

TANGIBILITY 

Number of Tangible words : 343 

Number of Intangible words : 767 

Tangibility is tangibible. 

EMOTIONAL CONNOTATIONS 

Sublime connotations : 7 

Pleasant connotations : 166 

Neutral connotations : 861 

Unpleasant connotations : 75 

Horrid connotations : 1 

Percent of sublime connotations 0 

Percent of pleasant connotations 9 

Percent of unpleasant connotations 4 

Percent of horrid connotations 0 

Index of Emotionality 26 

Emotionality is Rich 

Tone is Positive 
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VIGOR OF WORDS 



Words 


of 


Extreme Vigor : 


1 


Words 


of 


Much Vigor : 


9A 


Words 


of 


Some Vigor : 




Words 


of 


Little Vigor : 


551 



Percent 


of 


words 


of 


extreme vigor : 


Percent 


of 


words 


of 


much 


vigor : 


Percent 


of 


words 


of 


some 


vigor : 



Index of Vigor 50 

Vigor is lively 



0 

5 

25 



RECOMMENDATION NUMBER 1 



You tend to write run-on sentences. 

Check your longest sentences for run ons. 
Break them up into units of single ideas. 



RECOMMENDATION NUMBER 2 



Your average sentences are too long for the difficulty of your vocabulary. 
Use simpler words or shorter sentences. 
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Excerpt from A Farewell to Arms 



PROFILE 



Total of sentences 
Total of words 
Total of letters 



65 

1349 

5431 



1349 words are 



plenty for valid statistical analysis. 



Of the 1349 words in Excerpt from A Farewell to Arms 
1197 were matched to words in the Stylist dictionary. 

88 % were matched. 

This is enough for statistical analysis. 
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BREAKDOWN OF PERCENT OF LETTERS PER WORD 
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X 
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X 
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X 
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X 

X 

X 

X 

X 

X 

X 

X 



X 
X 
X 

3 4 5 6 7 8 
Number of Letters in a Word 



X 

X 

X 

X 

X 

X 

X 



X 

X 

X 

X 



X 

9 



10 



11 



12 



13 



14 



15 



The average number of letters per word 



5.100817439E+00 



A typical fiction texts distribution 
resembles a tall bell-shaped curve centered around five 1 etters/word . 



The length of the words is medium 
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BREAKDOWN OF NUMBER OF WORDS PER SENTENCE 



Number 

100 

98 

96 

94 

92 

90 

88 

86 

84 

82 

80 

78 

76 

74 

72 

70 
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64 
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58 

56 

54 

52 

50 

48 

46 

44 

42 

40 
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of words 



X 
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X 
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X 



8 XXXX XXXXXXXXXXXXXXXXXX XXXXXXX XXXXXXXXXXXX XXX xxxxxxxxxxxxxxxx 

6 xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxx 
4 xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
2 xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 



10 



20 



30 

Sentence 



40 

Number 



50 



60 



70 



The average number of words per sentence 



2.075384615E+01 



A typical modern texts sentences average between fifteen and twenty words. 



Sentences are long. 



Number of run ons 3 

Percent of run ons 4 



Run ons are acceptable. 
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Number of medium length sentences 26 
Percent of medium length sentences 40 

Modulation is average 

ETYMOLOGY OF WORDS 

Number of Latinate words : 157 

Number of Germanic words : 484 

Etymology is very native. 

DIFFICULTY OF VOCABULARY 

PostGraduate difficulty : 0 

Graduate difficulty : 0 

High School difficulty 25 

Elementary difficulty : 596 

Percent of Postgraduate difficulty 0 

Percent of Graduate difficulty 0 

Percent of High School difficulty 2 

Difficulty is easy. 

TANGIBILITY 

Number of Tangible words : 179 

Number of Intangible words : 442 

Tangibility is tangibible. 

EMOTIONAL CONNOTATIONS 

Sublime connotations : 1 

Pleasant connotations : 135 

Neutral connotations : 450 

Unpleasant connotations : 34 

Horrid connotations : 1 

Percent of sublime connotations 0 

Percent of pleasant connotations 11 

Percent of unpleasant connotations 2 

Percent of horrid connotations 0 

Index of Emotionality 26 

Emotionality is Rich 

Tone is Positive 
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VIGOR OF WORDS 



Words 


of 


Extreme Vigor : 


4 


Words 


of 


Much Vigor : 


73 


Words 


of 


Some Vigor : 


324 


Words 


of 


Little Vigor : 


220 



Percent 


of 


words 


of 


extreme vigor : 


0 


Percent 


of 


words 


of 


much vigor : 


6 


Percent 


of 


words 


of 


some vigor : 


27 



Index of Vigor 57 

Vigor is strong 



This is a solid piece of writing well within the traditions of its genre. 

You are as able to understand the meaning of the above characteristics as The St 
ylist . 

Stand the course! 
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Computer Science Text 1 



PROFILE 



Total of sentences 
Total of words 
Total of letters 



56 

1178 

5868 



1178 words are plenty for valid statistical analysis. 



Of the 1178 words in Computer Science Text 1 

1026 were matched to words in the Stylist dictionary. 

87 r< were matched. 

This is enough for statistical analysis. 
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BREAKDOWN OF PERCENT OF LETTERS PER WORD 
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X 



3456789 10 11 

Number of Letters in a Word 



12 13 14 15 



The average number of letters per word : 6 . 500000000E+00 

A typical nonfiction texts distribution 

resembles a low bell-shaped curve centered around six 1 etters/word . 
The length of the words is medium 
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1 BREAKDOWN OF NUMBER OF WORDS PER SENTENCE 

Number of words 
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4 xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
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10 20 30 40 50 



60 



Sentence Number 



70 



The average number of words per sentence 



2.105571428E+01 



A typical modern texts sentences average between fifteen and twenty words. 
Sentences are long. 



Number of run ons 1 

Percent of run ons 1 



Run ons are acceptable. 
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Number of medium length sentences 28 
Percent of medium length sentences 50 

Modulation is average 

ETYMOLOGY OF WORDS 

Number of Latinate words : ^22 

Number of Germanic words s 227 

Etymology is mixed 

DIFFICULTY OF VOCABULARY 

PostGraduate difficulty : 1 

Graduate difficulty : 55 

High School difficulty ; 273 

Elementary difficulty : 320 

Percent of Postgraduate difficulty 0 

Percent of Graduate difficulty 5 

Percent of High School difficulty 26 

Difficulty is challenging 

TANGIBILITY 

Number of Tangible words : 106 

Number of Intangible words : 5A3 

Tangibility is very intangibible . 

EMOTIONAL CONNOTATIONS 

Sublime connotations *. 1 

Pleasant connotations : 50 

Neutral connotations : 562 

Unpleasant connotations : 32 

Horrid connotations : A 

Percent of sublime connotations 0 

Percent of pleasant connotations 4 

Percent of unpleasant connotations 3 

Percent of horrid connotations 0 

Index of Emotionality 14 

Emotionality is average 

Tone is Positive 
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VIGOR OF WORDS 
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Index of Vigor 
Vigor is lively 



^4 



0 

3 

29 



This is a solid piece of writing well within the traditions of its genre. 

You are as able to understand the meaning of the above charac teristi cs as The St 
yl ist . 

Stand the course! 
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Excerpt from Galapagos by Vonnegut 

PROFILE 



Total of sentences : 22 
Total of words : 542 
Total of letters : 2526 



542 words are enough for valid statistical analysis. 



Of the 542 words in Excerpt from Galapagos by Vonnegut 
449 were matched to words in the Stylist dictionary. 

82 y. were matched. 

This is enough for statistical analysis. 
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BREAKDOWN OF PERCENT OF LETTERS PER WORD 
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XXX. 
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X X X X X 

X X X X X 

X X X X X 

3 A 5 6 7 

Number of Letters in a 



X 

X 

8 

Word 



X 

X 

X 

X 

9 



X 

10 11 12 13 lA 15 



The average number of letters per word 



5.760233918E+00 



A typical fiction texts distribution 
resembles a tall bell-shaped curve centered around five 1 etters/word . 



The length of the words is long. 
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1 BREAKDOWN OF NUMBER OF WORDS PER SENTENCE 

Number of words 
100 
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14 XXX XXXX XXXXXXXXXXX X 
12 XXX XXXX XXXXXXXXXXX X 
10 XXX XXXX xxxxxxxxxxxxx 
8 xxxxxxxx xxxxxxxxxxxxx 
6 xxxxxxxxxxxxxxxxxxxxxx 
4 xxxxxxxxxxxxxxxxxxxxxx 
2 xxxxxxxxxxxxxxxxxxxxxx 

10 20 30 40 50 60 70 

Sentence Number 

The average number of words per sentence : 2 . 463636364E+01 

A typical modern texts sentences average between fifteen and twenty words. 



Sentences are too long. 

Number of run ons 2 

Percent of run ons 9 

Run ons are unacceptable 
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Number of medium length sentences 
Percent of medium length sentences 

Modulation is average 



10 

45 



ETYMOLOGY OF WORDS 



Number of Latinate words 
Number of Germanic words 



Etymology is very native. 



86 

184 



DIFFICULTY OF VOCABULARY 



PostGraduate difficulty : 0 
Graduate difficulty : 3 
High School difficulty : 49 
Elementary difficulty : 218 

Percent of Postgraduate difficulty 0 
Percent of Graduate difficulty 0 
Percent of High School difficulty 10 



Difficulty is easy. 



TANGIBILITY 



Number of Tangible words : 
Number of Intangible words : 

Tangibility is tangibible. 



76 

194 
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EMOTIONAL CONNOTATIONS 



Sublime connotations : 2 
Pleasant connotations : 79 
Neutral connotations : 170 
Unpleasant connotations : 13 
Horrid connotations : 6 



Percent of sublime connotations 0 
Percent of pleasant connotations 17 
Percent of unpleasant connotations 2 
Percent of horrid connotations 1 

Index of Emotionality A3 



Emotionality is Rich 
Tone is Positive 



VIGOR OF WORDS 



Words 


of 


Extreme Vigor 
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Words 


of 


Much 


Vigor 
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Some 
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Index of Vigor 
Vigor is very strong 



67 



0 

8 

27 



RECOMMENDATION NUMBER 1 



You tend to 
Check your 
Break them 



write run-on sentences, 
longest sentences for run 
up into units of single i 



ons . 
deas . 
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